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Preface 


This volume provides an introduction to the physics of beams. This field 
touches many other areas of physics, engineering and the sciences, and in 
turn benefits from numerous techniques also used in other disciplines. In 
general terms, beams describe ensembles of particles with initial conditions 
similar enough to be treated together as a group, so that the motion is a 
weakly nonlinear perturbation of that of a chosen reference particle. 

Applications of particle beams are very wide, including electron micro- 
scopes, particle spectrometers, medical irradiation facilities, powerful light 
sources, astrophysics — to name a few — and reach all the way to the largest 
scientific instruments built by man, namely, large colliders like LHC at CERN. 

'The text is based on lectures given at Michigan State University's Depart- 
ment of Physics and Astronomy, the online VUBeam program, the US Particle 
Accelerator School, the CERN Academic Training Programme, and various 
other venues. Selected additional material is included to round out the pre- 
sentation and cover other significant topics. 

'The material is at a level to be accessible to students of physics, mathemat- 
ics and engineering at the beginning graduate or upper division undergraduate 
level and can be viewed as an introductory companion to the more advanced 
Modern Map Methods in Particle Beam Physics by M. B., published by Aca- 
demic Press. Emphasis has been placed on showing major concepts in their 
original incarnations and through historic figures. Finally, some of the sec- 
tions and chapters that contain more advanced material are marked by a * 
symbol and can be omitted in a first reading. 

Many organizations and individuals have helped directly and indirectly at 
various stages in the development of this book. MSU's Physics and Astronomy 
Department provided an environment of support for this and other books, the 
VUBeam program, as well as many of our other activities. 

For two decades of continuous financial support that were instrumental 
to the success of the book, the VUBeam program, and indeed much of our 
research, we are grateful to the US Department of Energy, and in particular to 
Dr. Dave Sutter, the long-term coordinator of their beam physics activities. 

K. M. would like to thank her daughter Kazuko for her own great interest 
in physics and science and much encouragement during the finalization of this 
text. 

W. W. would like to thank Dr. D. Robin for his encouragement, Dr. E. 
Forest for stimulating discussions on various aspects of beam dynamics such 
as normal form theory, and his wife Juxiang Teng for her unwavering support 
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He Zhang for thoughtful comments about the material. We also are thankful 
to many authors, national laboratories and publishers allowing us to repro- 
duce published figures. The details are described in the corresponding figure 
captions. 

Last but not least, we are very grateful to the entire staff of Taylor & 
Francis for their continuous support, in particular to Francesca McGowan for 
her great interest and productive comments. 
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Chapter 1 


Beams and Beam Physics 


In this chapter we will lay the foundations of basic concepts about beam 
physics, and discuss various important mechanisms of production and accel- 
eration of beams. Because of the breadth of the material and the multitude 
of existing devices for each of the mechanisms, we will focus only on key con- 
cepts, and introduce them through the eyes of their inventors by using their 
original historical drawings, with only minor adjustments for uniformity of 
style and technical clarity. 


1.1 What Is Beam Physics? 


The field of beam physics deals with motion of ensembles of particles 
(usually charged) in electromagnetic fields. It is called beam physics due 
to the fact that, in most cases, those particles have similar coordinates, 
which is the rough definition of a beam. In many cases, the positions and 
momenta of the particles are sufficient to describe their motion. In this 
case, the particles are described by a state vector consisting of positions and 
momenta 


Z = (£, Pr, Y, Py, Z, Pz) i: 


In other cases, additional coordinates may be needed; typical examples in- 
clude the mass, sometimes the charge, or the spin vector and the related 
magnetic moment and possibly electric moment of the particle. 

An ensemble of particles with such similar coordinates is called a beam 
(see Fig. 1.1), and the sub-fields concerned with the study of such beams is 
called beam physics. There are other fields of physics that can be described 
in very similar terms and language, some of the most notable examples being 
light optics and astrodynamics. There are also other sub-fields of physics 
dealing with the study of the motion of such ensembles of particles; important 
examples are plasma physics and the dynamics of galaxies. These fields 
are different from beam physics in that in their cases, the particles usually do 
not have rather similar coordinates but occupy larger regions. 

The space of state vectors Z is often called phase space, and a coordinate 
system showing Z is often called a phase space diagram. The volume of 
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Px 


Ensemble 
of particles 


FIGURE 1.1: A beam — an ensemble of particles in the vicinity of a 
reference particle with phase space coordinate Zo. 


the cloud of particles in phase space has a special name. It is called emit- 
tance. As we shall see later, in many systems the emittance is conserved and 
hence plays a special role. 

Because all particles are close together, it is often useful to pick one of 
these particles, typically one that is somewhere in the middle, and describe 
the motion of the others relative to this reference particle. So if the 
reference particle has coordinates Žo, then the motion of the particles would 
be described in the relative coordinates AZ = Z — Zu 

In many cases, the density of particles is so low that their interaction can 
be neglected or expressed by simple collective models. In other cases, it is 
necessary to include the study of the self-interaction, i.e., we have to take into 
account the fields due to the space charge. 

If the fields are electromagnetic, then the motion is described by the Lorentz 
force law, which in SI units is 


dp + E 
Z =a(B+exB). (1.1) 
Here E and B are the electric and magnetic fields, respectively. These fields 
are connected to the scalar potential V and the vector potential A via the 
relations 


Although this may not be directly relevant in this book, we want to note here 
for the sake of completeness that the equations of motion in the form of the 
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Lorentz force law can also be obtained from the Lagrangian 


A N qe Ü. A — qV; (1.2) 
S c q q , a 


refer to eqs. (1.85), (1.145) in [5], and, for example, [29]. From this La- 
grangian, one can also obtain a Hamiltonian of the motion in a procedure 
that is standard for all Lagrangian systems. One begins by defining the canon- 
ical momentum as: 


"OE OL 
Pecan d Ov’ 
which here has the form 
Dean = ym + qA = Payn + qå, (1.3) 
where 
1 


vr 
and the canonical momentum Pean is different from the relativistic dynamical 
momentum 
Dayn = yw. (1.4) 
The Hamiltonian of the motion can then be found as 


H = Pean U— L. 


This expression initially contains both Pean and @, and it is necessary to elim- 
inate y and express it in terms of Pean. Because Payn = ym and pays = 
Pcan E qA from eq. (1.3), ym = muU/ V l= v2/c? = Dean = qA, leading to 
m^? (1 = vc) = (Dean = qA)?, so we find 

Dean — qÅ 

Pcan q Pam (1.5) 


A? | p2 2 
(Bean E aA) + nées nt ue 7 


where the expression in terms of pay; is listed as well. Using this, y1 — v?/c? 
in the Lagrangian L in eq. (1.2) is expressed in terms of Pean, also in terms 


of Bayn, as 
I mee mc 
L———————. (1.6) 
pi m2? 
(Bean E aA) + m2c2 dyn 
Thus, we obtain for the Hamiltonian 


ae 
H-c- (Bean — 24) + mc? +qV; 
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refer to eq. (1.149) in [5] for details of the derivation. 

When studying the evolution of the beam from the time it is born until it is 
used, there are usually four steps involved. First, there must be a way for the 
production of the beam, and for the sake of efficiency if possible in such a 
way that its emittance is small. Second, in most cases the energy of the beam 
has to be increased; there has to be a mechanism of acceleration. Because 
of the outstanding importance of this process, the whole field is often called 
accelerator physics. Then it is necessary to transport the beam to where 
it is being used. And finally, there is often a need for storage of the beam 
for use at a later time or reuse. Lastly, often there is a need for analysis of 
the beam, in particular after the beam has been used for its purpose, which 
frequently is the facilitation of certain nuclear or high energy reactions. The 
field of beam physics spans all these steps, and each of the steps has it own 
unique problems to be solved. 


1.2 Production of Beams 


The mechanisms used for the production of the beam depend very much 
on the particular kind of particles and the characteristics of the beam that 
is needed, and they include mechanisms from a variety of different fields in- 
cluding thermal, electrical, atomic, nuclear, and even high energy physics 
processes. Common beams consist of electrons, protons, or H~, and some 
of the beams produced through nuclear and high energy physics processes in- 
clude positrons, antiprotons, pions, kaons and radioactive nuclei. Overall, due 
to the diversity of the species of the particles and the required properties of 
the beam, there are dozens of different ways of producing various beams. We 
here restrict ourselves to some of the source types that are most commonly 
used in particle accelerators and electron microscopes. 


1.2.1 Electron Sources 


Electrons exist in abundance in metals, and forming them into beams re- 
quires their extraction from the metal, called the cathode. For this the 
electrons need to overcome the potential barrier, i.e., the work function, 
at the boundary between the metal and the environment. The work func- 
tion usually ranges from a fraction of an electron Volt (eV) to a few electron 
Volts; for comparison, the average kinetic energy of gas molecules at room 
temperature amounts to approximately 1/40 eV. This can be achieved by ei- 
ther supplying additional energy to the electrons so that they can leave the 
material, or by lowering the work function. In the following we discuss some 
common approaches based on these methods. 
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FIGURE 1.2: Sketch of an early thermionic emission electron source. 
(Reprinted with permission from J. R. Pierce, J. of Appl. Phys., 11:548, 
1940 [57]. Copyright 1940, AIP Publishing LLC.) 


The first of these processes is thermionic emission. By heating a piece 
of metal to temperatures exceeding around 1000°C, a small fraction of the 
electrons will achieve energies exceeding the work function and can thus leave 
the metal. This type of source is usually called the thermionic gun. Once 
outside the metal, the electrons can be pulled away further by the application 
of strong electric fields, the distribution of which is adjusted to achieve high 
gradient and optimal focusing. An example of such a device is shown Fig. 1.2. 
Here the number of electrons available is determined by the temperature of 
the donor metal or cathode, and only those electrons in the tail of the Fermi- 
Dirac distribution above the work function can be extracted. This process is 
quantitatively described by the Richardson-Dushman equation 


4 ke 
J= ( T 2) T?e-WikaT. (1.7) 


where J is the current density, e is the charge, m is the mass, kg is the 
Boltzmann constant, h is the Planck constant and T'is the temperature of 
the cathode. It is obtained from the third law of thermodynamics and char- 
acterizes an idealized situation of a sufficiently large piece of cathode material 
to avoid quantum mechanical influences, and the absence of electric fields 
influencing extraction. 

In practice the extracted current is also greatly affected by any electric field 
applied to the cathode. This is the result of the Coulomb repulsion among the 
extracted electrons, where electrons extracted earlier can push those extracted 
later back into the cathode. When the electric field at the surface vanishes, no 
more electrons will be extracted. The relation between the maximum current 
density and the applied electric field, for a parallel flat cathode and a matching 
anode, is the Child-Langmuir Law 


4 2e 1/2 V3? 
J= 9° (=) d ; (1.8) 
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FIGURE 1.3: Left: Sketch of one of the earliest electron sources using 
point cathodes. (From Y. Sasaki and S. Maruse, Uber die Arbeitsweise und 
die elektronenoptischen Eigenschaften der Spitzenkathode, in G. Móllenstedt, 
H. Niehrs, and E. Ruska, eds., Physikalisch- Technischer Teil, 1:9, Springer- 
Verlag, 1960, (O Springer-Verlag Berlin Heidelberg 1960 [61]. Abb. 3, "Zwei 
Anordnungen der Elektrodensysteme für die Spitzenkathode." With kind per- 
mission from Springer Science and Business Media.) Right: The potential 
(dashed) and the field (solid) distribution near the cathode tip. 


where J is the current density, eo is the dielectric constant in vacuum, e is the 
charge, m is the mass, Vo is the applied voltage between the cathode and the 
anode, and d is the distance between the cathode and the anode. In practice 
the situation is more involved, and the maximum current density is usually 
the smaller of the two quantities. For flat thermionic cathodes, usually eq. 
(1.7) sets the limit for the extracted current. 

Due to the high operating temperature, the energy spread of the extracted 
electron beam is relatively large. Nonetheless, the thermionic gun is simple 
and reliable, and hence is still widely used as the source for many devices 
where the large energy spread of the electrons at the cathode is not limiting 
the performance of the machine. One significant example are circular electron 
accelerators, where the ultimate energy spread in the beam is dominated by 
other processes including synchrotron radiation discussed later. 

'The second process to produce electrons is field emission. In this mech- 
anism, a sharp needle is brought into strong external electric fields. This 
type of source is usually called the field emission gun. Because the needle 
is a conductor, it acts as an equipotential surface, and thus produces very 
strong electric fields near its tip. In practice the radius of curvature at the tip 
often ranges from below 1 nm to nearly the range of single atoms to 1 um, 
and one locally obtains a very strong field ranging from 1 to 3 GV/m. These 
strong electric fields acting near the surface lower the work function and si- 
multaneously reduce the width of the potential barrier, which allows electrons 
to escape through the well through quantum tunneling. All these electrons 
emerge from a small area. Furthermore, due to the fact that the electrons 
are diverging near the surface of the needle, the actual source is quite a bit 
smaller than the emitting area. Meanwhile, for low current, their spread in 
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momenta is rather small as well. Fig. 1.3 shows the basic principle. 

The small emitting area imposes a severe limit on the total current ex- 
tracted, but it is the standard electron source for transmission electron mi- 
croscopes discussed in more detail below. Here the current need is low, but 
the origination from a small area and with small energy spread amounting to 
what is called high brightness is very useful. The use of sharp needles (point 
filaments as they were historically called) was pioneered in the 1950s in order 
to increase brightness of the electron beam in microscopes. 

Particularly fruitful is combining needle geometries, resulting in an effective 
lowering of the work function due to stronger electric field at surface, with 
heating, which results in an increase of electrons of higher energy than the 
work function that can thus traverse it. This kind of thermionic emission with 
significant external field is called Schottky emission. In the 1960s, a new 
generation of cathodes (mainly ZrO/W, which is a tungsten tip covered with a 
thin layer of ZrO) were developed with stronger field at the surface, where field 
emission plays a significant role and complements Schottky emission. This 
regime is called the extended Schottky emission. This kind of emitters 
are the main sources of electrons for electron microscopes. The emitters that 
produce electrons through only the field emission process, called cold field 
emission gun (CFEG), have been studied since the 1970s, and recently 
have been able to produce high brightness beams. 

The third process to produce electrons is photoemission where electrons 
are produced via the photo effect. Exposing a surface to a large flux of 
photons leads to some of the photons being absorbed by electrons within the 
material, which consequently increase their kinetic energy. This additional 
kinetic energy acts very similar to intense local heating in the thermionic gun 
and leads to electrons with energies exceeding the work function and which 
can consequentially escape. In certain cases photo-energized electrons can also 
leave the material directly without colliding with other electrons, in a ballistic 
process. In practice, photons are supplied through laser pulses, of which the 
intensity, spot size and duration are relatively easy to adjust. This leads to 
the photocathode gun. This approach has greatly facilitated the advance 
of free electron lasers (FELs) in the last few decades and is an important 
component in efforts of time-resolved spectroscopy and microscopy. 

Again the extracted current is limited by the Child-Langmuir law (1.8), 
despite the fact that an intense laser pulse can often produce large numbers of 
electrons. Different classes of materials have been developed for the cathode, 
including GaAs that has been cesiated, i.e., covered by less than a mono-layer 
of cesium. This has allowed the production of electrons with energy spreads 
down to fractions of 107! eV, which is the range of thermodynamic energies 
encountered at room temperature. 

Other materials such as copper are able to withstanding the harsh environ- 
ment of a radio frequency (RF) gun, where very high extraction fields can 
be produced that significantly exceed those of the electrostatic case. Fig. 1.4 
shows the layout of an RF gun and the field distribution. It consists of roughly 
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FIGURE 1.4: Layout (left) and field distribution (right) of the RF gun at 
the Linac Coherent Light Source (LCLS). (From J. Arthur, et al., SLAC-R- 
593, 2002 [2]. Courtesy SLAC National Accelerator Laboratory.) 


1.5 cells (1.6 for this example) with the field in the two cells in opposite phase 
to ensure that electrons emitted into high extracting field end up with high 
energy and low emittance at the exit. In addition, materials such as cesiated 
GaAs with strained lattice can also produce polarized electrons, which has 
been used at SLAC National Accelerator Laboratory, California, USA, for 
high energy physics experiments on the one hand and in spin polarized low 
energy electron microscope (LEEM) on the other. 


1.2.2 Proton Sources 


In most proton accelerators, the protons are produced through stripping 
the two electrons in a negative hydrogen ion H^ at the entrance, mostly by 
passing the ion through a thin carbon foil, although lasers have also been ex- 
perimented with recently. In some applications, such as medical accelerators, 
Hj ions instead are produced and accelerated. 

Although the stripping process makes the technology a little more com- 
plicated, it has one distinct advantage over injecting protons directly. For 
injection into circular accelerators over many turns, the protons generated 
through stripping can easily be aligned with earlier produced protons already 
in the accelerator, because before stripping, both types of particles follow dif- 
ferent paths due to the differing charge. As a result, the density of protons in 
the beam can increase more and more over an extended injection period. On 
the other hand, protons injected directly require much more care and can only 
be injected with positions and directions that are different from those of other 
protons already in the ring, since because of time reversal, all protons with the 
same position and direction must have followed the same earlier trajectory. 

The H- ions can be produced by a variety of different methods. Here, only 
the surface plasma source of the magnetron type is discussed. It obtained 
its name due to the fact that it is similar in configuration to the ubiquitous 
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FIGURE 1.5: Drawing of a surface plasma source of the magnetron geome- 
try. (From J. Ishikawa, Negative ion sources, in I. G. Brown, ed., The Physics 
and Technology of Ion Sources, 2nd ed. [31]. Copyright (c) 2005 Wiley-VCH. 
With permission.) 


microwave source called the magnetron, which is used in common appliances 
such as the microwave oven. The main process of producing the negative 
hydrogen ions is through electron capture of neutral hydrogen atoms on the 
cathode surface, where electrons penetrate the potential barrier through quan- 
tum tunneling. The presence of the electric and magnetic field creates a dense 
plasma near the surface of the cathode where ions, most of which are positive, 
are produced. Those positive ions and neutral particles bombard the cathode 
partially covered with cesium and H^ ions are produced. The presence of ce- 
sium lowers the work function and greatly increases the probability of barrier 
penetration and hence H^ production. 

Some of the produced H^ ions are neutralized shortly after production. 
An electric field between the cathode and the anode is used to accelerate 
the remaining H^ ions towards the exit. However, electrons are accelerated 
towards the anode as well. But since electrons are much lighter than the H^ 
ions, they are bent much more easily than the H^ ions, and are absorbed by 
the electron collector (Fig. 1.5). 

The magnetron type of negative hydrogen ion source was first developed 
in the former Soviet Union in the early 1970s and quickly spread to Europe 
and the United States. It has become the main choice for high energy proton 
accelerators. 


1.2.3 Ion Sources 


There are a large variety of different sources for ions which found applica- 
tions in many fields. Presenting even a brief overview is already beyond the 
scope of this book; for more comprehensive reviews, see [77, 8]. As in the 
previous subsection, we limit our discussion to one important kind, which is 
the electron cyclotron resonance (ECR) ion source. Fig. 1.6 shows the 
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mechanism schematically. The chamber on the left holds the plasma where 
ions are produced. First, the gas of the element of interest is injected to 
the chamber. Collisions among the atom generates electrons and ions. The 
magnetic field (see top part of Fig. 1.6) forms a magnetic mirror that con- 
fines the ions and the electrons. For the example shown, this time is around 
100 us, and for more modern and advance versions for around 10 ms. High 
frequency (2.45 to 28 GHz) microwaves are injected into the chamber and 
electrons with rotation frequency matching that of the microwaves are accel- 
erated to between 1 and 20 keV. This process of heating up the electron gas is 
called electron cyclotron resonance heating. The resulting hot electrons 
collide with ions and neutral atoms and generate more ions. Furthermore, 
through step-by-step ionization, even multiply charged ions (e.g., Xe?5*) can 
be produced. 

In order to produce sufficient quantities of multiply charged ions, the ion 
confinement has to relatively long (~ 10 ms). The major improvement in this 
aspect is the addition of a sextupole magnet, which ensures that the magnetic 
field at the center of the chamber is at the minimum, which prevents the ions 
from drifting to the side wall. Another consequence of this configuration of 
the magnetic field is that the surface on which electron cyclotron resonance 
heating takes place is now closed, which significantly reduces hot electron 
loss. T'his configuration of the magnetic field also makes ion production more 
efficient, since electrons are confined longer and can collide with ions and 
be reheated many times. The fact that the ECR ion source does not use a 
cathode to generate electrons makes it a much more reliable source compared 
to other varieties. 

'The ECR ion source was first developed in the mid-1960s and, by the mid- 
1970s, many had been built around the world. It is probably the best source to 
produce multiply charged ions and has become the main choice of ion sources 
for nuclear physics facilities. 


1.3 Acceleration of Beams 


We now assume that an ensemble of particles occupying a small volume of 
phase space has been created, and we thus have what is called a beam. In 
many if not most of the practical cases, the energy that the beam has after 
being produced by the source is not sufficient for the purpose it is to be used 
for, which frequently amounts to furnishing the energy necessary for atomic, 
nuclear, or particle processes of interest. 

In most cases, the motion is best studied by first considering the motion 
of the reference particle, and once this motion is understood satisfactorily, 
to study the relative motion of the other particles. For a simple analysis 
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FIGURE 1.6: Layout of the first electron cyclotron resonance (ECR) ion 
source that produced multiple charged ions. (Reprinted with permission from 
R. Geller, Appl. Phys. Lett., 16:401, 1970 [28]. Copyright 1970, AIP Publish- 
ing LLC.) 


of the relative motion, often a linear approximation with all the resulting 
simplifications is possible, but frequently a full understanding of the motion 
can only be achieved by considering the nonlinear effects. 

Considering the special shape of the Lorentz force law (see eq. (1.1)), 
since & x B is perpendicular to the velocity v, it is apparent that magnetic 
fields cannot be used for purposes of acceleration, which requires forces in 
the direction of the particle. Thus any acceleration has to be provided by 
electric fields. However, as we shall see, also magnetic fields have very good 
use in particle accelerators, as they can be employed to guide the beam to 
where it is needed. In particular, in the process of acceleration they are often 
used to guide the beam through the same region of electric field repeatedly 
and thus allow the device to maximize the use of the electric fields. Indeed, 
for this purpose of guiding the beam, magnetic fields are usually even better 
suited than electric fields. T'his is because for the high velocities that beams 
usually have after even modest acceleration, the forces that can be attained 
with technologically available magnetic fields far exceed those that can be 
achieved with the respective electric fields. 

Very generally, the amount of energy K a particle gains while traveling 
from time tı to time tz in an electric field E(7,t) that depends on position 
and time is given by the path integral 


m af Bw.» a(t) dt, 
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FIGURE 1.7: The general principle of the Cockcroft- Walton generator. 
(From J. D. Cockcroft and E. T. S. Walton, Proc. Royal Soc. London, A, 
136:619, 1932 [14]. With permission.) 


where 7(t) is the particle's position as a function of time and g(t) its velocity. 
In the special case that E is time independent and hence can be written 
in terms of a potential via E- -VV, this path integral reduces in a natural 
way to the difference in potential as 


K —q-(V() - V(r2). 


'This simple fact implies a very important consequence for the design of electric 
accelerating fields: if there is to be any chance to utilize the same electric field 
repeatedly for the purpose of acceleration, then the electric field has to 
be time dependent, because otherwise repeated passing just results in a 
periodic increase and decrease of energy. In fact, the attempt to build an 
accelerator trying to increase energy repeatedly by flying through the same 
time independent field is tantamount to the attempt to build a perpetual 
motion machine. 


1.3.1 Electrostatic Accelerators 


The first class of important accelerators are those based on static electric 
fields. The kind that grew directly out of the area of electric circuit is the 
voltage multiplier that is now known under the name Cockcroft-Walton 
generator. It consists of a simple but clever circuit made of diodes and ca- 
pacitors, forming the voltage multiplier ladder. Each time a base voltage is 
applied, the first capacitor is filled with charge. Each time the voltage is re- 
moved, the first capacitor passes on some of its charge to the second capacitor, 
which in turn feeds the third capacitor, and so on. Depending on the number 
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FIGURE 1.8: Design sketch of the Van de Graaff high voltage generator. 
(From R. J. Van de Graaff, US Patent 1,991,236, 1931 [18].) 


of capacitors and the number of cycles applied, quite high voltages can be 
obtained very easily. 

For small applications it is possible to simply apply alternating current 
(AC) at the feeding end. For larger applications it may require a longer 
time to transfer the charges from the lower to the higher capacitors, and 
the change in input voltage is achieved through mechanical switches. Fig. 1.7 
illustrates “the general principle underlying the method adopted,” and is from 
the original paper on the matter [14]. Many high energy proton accelerators 
use Cockcroft-Walton generators as the first stage of acceleration. 

Another method to obtain high voltages for electrostatic accelerators is the 
Van de Graaff accelerator [20] and several similar devices derived from it are 
the main representatives of the class of accelerators utilizing time independent 
fields. The voltage difference that the particles travel through is obtained with 
a Van de Graaff generator, which consists of an endless non conducting 
belt onto which charge is sprayed from a tip via field emission, and which is 
then transported to the inside of a hollow metal sphere where it is deposited. 

Since any charge on a conducting object accumulate on the outside and 
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FIGURE 1.9: Design sketch of the use of the Van de Graaff generator as 
a particle accelerator. (From R. J. Van de Graaff, US Patent 1,991,236, 1931 
[18].) 


create a field-free interior, a new charge can be brought in from the belt on 
the inside of the sphere without experiencing any opposing fields, and thus 
large amount of charges can be accumulated on the sphere, resulting in very 
high potentials. The mechanism of the Van de Graaff generator is shown in 
the left part of Fig. 1.8 and the complete machine is shown in Fig. 1.9. The 
sketches are from the original patent on the device [18]; see also [20]. 

In passing it is worthwhile to remark that while the newly added charge 
does not experience a field when moving from the belt to the inside of the 
sphere, it certainly experiences a field while approaching the sphere and being 
attached to the belt. Thus the potential energy contained on the charged 
sphere does not come for free. It is generated through the mechanical work 
that is necessary to move the belt and the attached charges toward the sphere. 

The charged sphere is connected to a metal enclosure containing the ion 
source, thus elevating the source to the potential of the charged sphere, which 
can then be utilized for the acceleration of the particles. 

The main practical limitation of the Van de Graaff accelerator is the ne- 
cessity to prevent sparks. This is achieved on the one hand by sheer size, 
because at the same potential difference, larger size means less electric field 
strength. On the other hand, it is important to inhibit the spark forma- 
tion process. Microscopically, sparks form in a gas when small numbers of 
charged particles have a mean free path length that is long enough so they can 
attain energies sufficient to ionize other particles upon collision, resulting in 
an avalanche. This can be avoided by choosing inert gases like He (helium) 
or SF¢ (sulfur hexafluoride), and on the other hand applying high pressure to 
reduce the mean free path length. 
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FIGURE 1.10: The principle of the tandem Van de Graaff accelerator. 
(Reprinted from Nucl. Instrum. Methods, v. 8, R. J. Van de Graaff, Tandem 
electrostatic accelerators, p. 195-202, Copyright (1960), with permission from 
Elsevier [19].) 


'The Van de Graaff accelerator has several desirable features; for example, it 
can produce a fully continuous beam (often denoted by the term “cw” for con- 
tinuous wave) and at high beam current. Its main limitation is the relatively 
low energies that it can produce, which seldom exceed about 20 MeV. 

The tandem Van de Graaff is an efficient modification of the Van de Graaff 
concept, in which both the source and the target are kept at ground potential 
and which can efficiently increase the energy that can be obtained. For 
this purpose, a source is chosen that produces negatively charged ions, which 
are then sent through a regular Van de Graaff. At the end of the accelerat- 
ing section, the ions are sent through a thin foil, in which many of them are 
stripped of some of their electrons, resulting in positive ions. Because the 
particles already have substantial energy when hitting the foil, often much 
higher charge states can be produced than in the ion source itself. These 
positive ions are then sent through a second stage Van de Graaff, which is 
essentially a reversion of the first stage. At the location of the target, depend- 
ing on their charge state after stripping, their energy is increased by a factor 
of two or more. The mechanism of the tandem Van de Graaff accelerator is 
shown in Fig. 1.10. Having very similar characteristics to the original Van de 
Graaff, the energies that can be achieved in this way are in the range of up 
to 60 MeV. 


1.3.2 Linear Accelerators 


It is an important observation that the field strength that can be obtained 
in quickly oscillating (radio frequency, RF) electric fields can be sub- 
stantially higher than what can be obtained statically in devices of similar 
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FIGURE 1.11: Sketch of the principle of the linear accelerator of the 
Wideroe type. (From R. Widerée, Uber ein neues Prinzip zur Herstellung ho- 
her Spannungen, Archiv ftir Elektrotechnik, 21:387, 1928, © Springer-Verlag 
Berlin Heidelberg 1928 [73]. Bild 1, “Prinzip der Spannungstransformation 
mit Potentialfeldern.” With kind permission from Springer Science and Busi- 
ness Media.) 


size. This is mostly due to reduced presence of spark formation, because the 
formation of an avalanche of charged particles requires time scales that are 
usually larger than the time the field is in one phase. 

The use of an oscillating field, however, immediately entails that only half 
of the cycle can be used for acceleration, and thus is different from static ac- 
celerators, as the resulting beams always have a temporal micro-structure, 
also called bunched. In practical use, usually several RF resonators are used 
sequentially, each one of which accelerating the particles, and it is very impor- 
tant that the phase relationship between the individual accelerating sections 
is correct. This is usually achieved by applying the fields between the edges 
of adjacent conducting tubes. This kind of device is called the linear accel- 
erator or linac. 

The concept of the earliest linacs is schematically shown in Figs. 1.11 and 
1.12. From the wiring scheme, it is clear that the electric field in adjacent 
gaps points to opposite directions. In order to ensure that charged particles 
are accelerated in every gap, the lengths of the tubes are chosen in such a 
way that the time the particles require to fly through them equals one half of 
the RF period. So the length L; of the ith tube has to be chosen so that it 


satisfies 
1 


Li = zvili 

pct 
where T,¢ is the period of the RF frequency. Apparently this leads to a system 
of tubes of increasing length, ie. Lı < Lə < L3 < .... The exact lengths 


Li, of course, depend on the relationship between the kinds of particles and 
the values of the accelerating voltages, and so often these designs are rather 
customized geometries. Since metal wires are used to connect the tubes, the 
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FIGURE 1.12: Illustration of a linear accelerator, designed by E. O. 
Lawrence and H. D. Sloan. (Reprinted the middle picture of Fig. 1 with 
permission from [64] as follows: D. H. Sloan and E. O. Lawrence, Phys. Rev., 
38, 2021, 1931. Copyright (1931) by the American Physical Society.) 


frequency of the oscillating field is limited to below 100 MHz due to increased 
radiation at high frequency. This in turn imposes an upper limit on the 
velocity of the particles (8 = v/c < 0.03) due to practical limit of the length 
of the tubes. Meanwhile, use of wires at low frequency can significantly reduce 
the size of the accelerating structure compared to a closed structure, called 
RF cavity. 

Another type of linac, called the Alvarez linac, developed in the late 1940s 
is shown in Fig. 1.13. It uses closed structures, the RF cavities, to increase the 
oscillating frequency and reduce the length of the tubes. Another difference 
is that the field in adjacent gaps points to the same direction. As a result, 
the length L; of the ith tube has to be chosen so that it satisfies 


Li = Vil. 


Apparently, the frequency has to be at least twice of the Wideróe type to 
reduce the tube length. In practice, the frequency of the Alvarez linac is 
roughly an order of magnitude higher than that of the Wideróe type. 

Fig. 1.13 is remarkably informative of the physics of the accelerator. The 
following sentences are excerpts from the original US patent [1]. The left top 
picture is “a diagram showing a normal cylindrical wave guide and the axial 
electric field distribution therein." The left bottom picture is “a diagram 
showing a wave guide and the electric fields when a series of graded drift 
tubes are placed therein." The right top picture is “a diagram representing 
the voltage existing across the gaps of the drift tubes." The left picture of 
the right bottom corner is “a diagrammatic longitudinal sectional view of 
drift tube ends with the electric field distribution existing across the gap." 
The right picture of the right bottom corner is “a diagrammatic longitudinal 
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FIGURE 1.13: Sketches illustrating the basic principles of the linear ac- 
celerator of the Alvarez type. (From L. W. Alvarez, US Patent 2,545,595, 
1947 [1].) 


FIGURE 1.14: The structure of the RFQ, the radio-frequency quadrupole 
linear accelerator. The picture shows an early example at Los Alamos Na- 
tional Laboratory, New Mexico, USA. (From K. R. Crandall, et. al., in R. L. 
Witkover, ed., Proc. 1979 Linac Conf., BNL-51134, 1979 [17]. Courtesy 
Brookhaven National Laboratory.) 


section view of drift tube ends with a focusing foil attached to one tube, and 
the resulting electric field distribution.” For details regarding phase stability 
(the right top picture), see Section 10.2. For details regarding transverse 
focusing and defocusing, see Section 10.4. 

An interesting combination of the need for bunching, accelerating and 
focusing (which is discussed later in detail) is the radio frequency quadrupole 
(RFQ) accelerator. Developed in the late 1960s in the former Soviet Union, 
RFQs have been widely adopted as injectors for proton and ion accelerators. 
Fig. 1.14 shows the structure of a RFQ accelerator. The four vanes break 
the rotational symmetry and produce an electrostatic quadrupole field that 
oscillates with time. The traveling particles feel the quadrupole field that 
changes polarity with time and are focused in both transverse planes. The 
longitudinal electric field that accelerates the charged particles is produced 
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through modulation of the vanes. Similar to the drift tubes, the particles are 
accelerated throughout the structure when the distance between the peak and 
the neighboring valley satisfies 


Li = m 

In general, linacs can provide beams of high current, and of higher en- 
ergies than static accelerators, yet because of the single use of each electric 
field, they are still rather expensive per MeV. Linacs are frequently used as 
pre-accelerators for accelerators of higher energies. They also have the dis- 
tinctive advantage that they avoid synchrotron radiation, which is often 
a limiting factor in circular accelerators for light particles. This aspect is 
very important for electron and positron high energy accelerators such as the 
Stanford Linear Collider (SLC) at SLAC National Accelerator Laboratory, 
California, USA. It is the main reason for the interest in next generation 
Linear Colliders, such as plans being considered for an International Linear 
Collider (ILC), where a pair of two linacs shoot electrons and positrons at 
each other at very high energy. 

Recently, linear accelerators have been widely used in producing a free 
electron laser (FEL), whose high peak brightness and short pulse duration 
has opened up unprecedented opportunities for scientific investigations. Fig. 
1.15 shows the setup of the first FEL experiment. Electrons go through a 
magnetic device called the undulator, which consists of alternating magnetic 
poles. As a result, the trajectory of such an electron is very similar to a sine 
function, causing the emitted photon field to add coherently. Together with 
the large number of periods, the peak intensity of the X-ray can be orders 
of magnitude higher than that from a circular accelerator (see the following 
subsection). The advantage of a linac is that it can produce an electron beam 
with smaller emittance and shorter pulse duration. 


1.3.3 Circular Accelerators 


Arguably the simplest circular accelerator is the betatron, which, besides 
its practical use as a compact accelerator for lower energies, also represents 
an excellent textbook style application of principles of electrodynamics. In 
the case of the betatron, the orbit follows a circular shape, which is achieved 
by a magnetic field. If the motion is perpendicular to the magnetic field, then 
we have in SI units 


mv? mu p 


—— = quB, andso p= — = —, 

p qB qB 
and so the radius of motion depends only on the momentum and charge of 
the particle as well as the magnetic field. Note that the equation is correct 
even in the relativistic case, if m is understood to mean the relativistic mass 
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FIGURE 1.15: Sketch of the first Free Electron Laser (FEL). (Reprinted 
with permission from J. M. J. Madey, J. Appl. Phys., 42:1906, 1971 [47]. 
Copyright 1971, AIP Publishing LLC.) 


m = «y: mo. Commonly the ratio of momentum and charge p/q is denoted by 
Xm and called magnetic rigidity; we apparently have 


p 
Xm = — — Bp. 
q 


Because x4, = Bp, the magnetic rigidity has the unit Tesla meter (Tm), and 
is frequently simply referred to as B rho. 

In the case of the betatron, both bending and acceleration come from the 
same source, namely a magnetic field the strength of which increases with 
time in such a way that its magnitude matches the increasing energy of the 
particles to keep them at nearly constant radius, and the circular induced 
electric field provides the acceleration for the particles. Fig. 1.16 shows a 
sketch of the betatron [37, 38]. 

It is worthwhile to note that the basic idea of utilizing an electric field 
produced by a changing magnetic field also occurs in an application from 
daily life: certain modern cooking surfaces. In this case, the electrons that 
are accelerated are not within the vacuum of a beam pipe, but merely in the 
metal that constitutes the bottom of the pot used for cooking; and of course 
since their mean free path is short, they do not attain high energies before 
colliding with either other electrons or the lattice atoms, thus transferring 
their whole kinetic energy to heat. 

A quantitative understanding begins with Faraday's law of induction, now 
one of Maxwell's equations: 
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19% 


FIGURE 1.16: Illustration of the magnet of a betatron, from the original 
first paper on the subject. (Reprinted Fig. 2 with permission from [37] as 
follows: D. W. Kerst, Phys. Rev., 60, 47, 1941. Copyright (1941) by the 
American Physical Society.) 


and its integral form over a surface A with the bounding C is 


> aB 
Be nds. 
f. di — OE -ndS 


Using the flux of the magnetic field through the surface ® = f} B - ads, 


f Ear- 
C 


Here we restrict our interest to circular orbits with a radius r, and the sur- 
face A is the inside of the circle. Building the magnet rotationally symmetric 
entails a rotational symmetry of the fields, which simplifies the situation to 

1 dé 1 dB dB 
E = -—— = —-—rr— = a 
2mr dt 2nr dt 24d 
where B is the average magnetic field enclosed by the orbit. Thus, by denoting 
the strength of E; simply by E, 


_rdBl 
2 dt’ 


and below we denote |B| by B for simplicity. Thus we obtain for the momen- 
tum p — mv E » 
rdB B 
= = qE = => p 
q UD de rus p Oe 
On the other hand, it is necessary that the centrifugal force on the orbit with 
radius r is compensated by the Lorentz force at that radius, which requires 


mv? 


"x quB (r) > mv — qrB (r). 
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Thus, altogether we obtain the following relationship between the field B(r) 
at the orbit r and the average field: 


'This equation of central importance is often called the betatron condition. 
It requires a magnetic field that is stronger in the center than where the 
particles move, which can be achieved by suitably shaping the poles of the 
magnet. 

In principle the temporal behavior of B is irrelevant, and in practice one 
usually tries to ramp it quickly, because the pulsed beam is only available 
at the end of ramping. This is usually achieved by making the magnet part of 
an LC circuit (a resonant circuit, consisting of an inductor L and a capacitor 
C), which also conveniently allows the device to recover the energy stored in 
the magnetic field for the next ramping. For the practical use, it is important 
to try to limit Eddy currents in the iron of the magnets, and in order to 
maintain the condition B (r) = B/2, it is important to control saturation 
effects that may occur at any edges of the magnet. 

'The transverse confinement of the beam in the betatron is achieved through 
the inhomogeneity of the outer field, through effects that will be studied in 
subsequent chapters. The practical use of betatrons is nowadays mostly for 
electrons, where energies of about 300 MeV have been achieved; for protons, 
the values are about 50 MeV. 

Also in the microtron, which was invented by V. Veksler [69], a magnet 
is used to bend the particles to let them pass through the same source of 
electric field repeatedly. Different from the betatron, the emphasis here lies 
on the production of a continuous beam. Since this requires that the whole 
acceleration process must be independent of the specific time of injection, 
this entails that the magnetic field is constant in time. Thus an external 
voltage source is needed; as discussed above, if it is to be used repeatedly, 
it has to be a time dependent source, and in practice it is chosen to be an 
RF (radio frequency) cavity. Altogether, the motion follows a sequence of 
tangential circles of increasing radius that touch at the location of the RF 
cavity, as shown in Fig. 1.17. 

In order to synchronize the particle's motion and the momentary direction 
of the magnetic field, the revolution frequency of the RF cavity wo has to be 
a multiple of the particle's revolution frequency w, which can be obtained 
simply from 

2 
ymov? E. (1.9) 


T "ymmo 


quB >w 


This means it has to be either the motion is such that y = 1, which corresponds 
to non-relativistic motion and hence severely limits the energy, or just enough 
acceleration is provided in each turn that the revolution frequency decreases 
to the next multiple of the RF frequency. So the revolution frequencies would 
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FIGURE 1.17: Illustration of the first microtron. (From S. P. Kapitza, 
The Microtron, Harwood Academic, London, 1978 [35]. Fig. 1.9, p. 14. With 
permission: © Taylor & Francis.) 


follow the pattern 
Wo Wo Wo wo 

w = Og a ae gere (1.10) 
This entails that the factor y follows the sequence y = ^o, 270,370,440; . . ., 
which requires Ay = 1 per turn. Since E = mc? = ymoc?, this means 
AE = moc’, and thus the necessary energy gain per turn must equal the rest 
mass energy of the particle under consideration. For electrons, this means 
AE = 511 keV and is thus possible; for protons, AE = 938 MeV and this is 
not easily possible within the confines of a conventional magnet. 

A very important further development of the concept of a microtron is 
based on the fact that if the orbits of the particles are far enough separated 
so that one can apply different magnetic fields for each orbit and can even 
change the shape of the orbit away from circular, then by careful choice of the 
orbit lengths, it is possible to maintain the synchronicity condition (1.10) 
while maintaining the freedom to have any amount of acceleration that is 
convenient. This is the basic idea behind CEBAF, the Continuous Electron 
Beam Accelerating Facility, at Thomas Jefferson National Accelerator Facility 
(TJNAF, Jefferson Lab, JLab), Newport News, Virginia, USA. 
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FIGURE 1.18: The principle of the cyclotron, with top view on the left 
and side view on the right. (From E. O. Lawrence, US Patent 1,948,384, 1932 
[40].) 


The basic idea of the cyclotron is similar to that of the microtron, except 
that the RF cavity is used more efficiently by providing acceleration twice or 
even more times per turn, and the orbits roughly follow concentric circles. 
The concept of the cyclotron is shown schematically in Fig. 1.18 [40]. 

According to eq. (1.9), the revolution frequency is 


eic agr (1.11) 
"ymo 


and the momentary radius of the orbit is 


p 


t E (1.12) 


'This entails very similar restrictions regarding relativistic effects as in the case 
of the microtron; as before, any deviation from constancy of the magnetic 
field prevents continuous injection of the beam and hence leads to a non- 
continuous outgoing beam. But because the orbits are nearly concentric, it is 
possible to at least partly compensate the relativistic effects by increasing 
B radially in such a way that the revolution frequency in eq. (1.11) stays 
constant. This kind of cyclotrons is called the isochronous cyclotron. If it 
is necessary to accelerate different particles in the same machine, then that 
entails that the actual field profile has to be adjustable, which is usually 
achieved by having one or several trim coils. The superconducting K1200 
cyclotron at the National Superconducting Cyclotron Laboratory (NSCL) at 
Michigan State University, Michigan, USA, allows for such corrections of the 
profile of the magnetic field. 


Beams and Beam Physics 25 


FF.A.G. Model 


negative field magnet positive field magnet 


FIGURE 1.19: The first model of an FFAG, the Fixed-Field Alternating 
Gradient accelerator. (From L. W. Jones and K. M. Terwilliger, in E. Regen- 
streif, ed., Proc. CERN Symp. High Energy Accelerators and Pion Physics, 
CERN 56-25, 1956 [34]. Courtesy CERN.) 


If continuity of the beam is not of prime importance, it is possible to make 
the necessary relativistic corrections due to eq. (1.11) via a decrease of the 
RF frequency during the acceleration process, which is done in the case of the 
synchrocyclotron. This decrease obviously has to happen very quickly over 
the few hundred turns of the particles while staying within the accelerating 
structure, and thus the pulse frequency can still be rather high. 


A variant of the cyclotrons that were studied intensively in the 1950s was the 
fixed-field alternating gradient (FFAG) accelerator. Fig. 1.19 shows the 
drawing of the first of such an accelerator built. It combines the feature of the 
fixed magnetic field as in a cyclotron and the idea of alternating gradient 
focusing that became widely known in the early 1950s. Although FFAG did 
not flourish as a high energy accelerator, it has generated renewed interest in 
the past decade as a candidate to rapidly accelerate decaying particles such 
as muons and ion beams with large emittance and momentum spread. 


For any accelerator, the ultimate energy limitation comes from the 
strength of the magnetic field that is available as the unavoidable restric- 
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FIGURE 1.20: Sketch of the Bevatron, designed to achieve “Billions of eV 
Synchrotron," at Lawrence Berkeley National Laboratory, California, USA. 
(From E. J. Lofgren, in E. Regenstreif, ed., Proc. CERN Symp. High Energy 
Accelerators and Pion Physics, CERN 56-25, 1956 [46]. Courtesy CERN.) 


tion 

P Z Bp = Xm. (1.13) 

q 
The range of available magnetic fields is rather limited; typical numbers are 
in the range of 1-2 T for normal conducting dipole magnets, and several 
times more for superconducting dipole magnets. The superconducting dipole 
magnets at the Large Hadron Collider (LHC) at the European Organization 
for Nuclear Research (CERN), near Geneva, in Switzerland and France, op- 
erate reliably at 8 T. (See Table 1.1.) Looking beyond the rather stringent 
requirements for particle accelerators regarding field quality over extended 
regions and temporal stability, as of 2013 the highest magnetic fields that 
can be achieved are about 100 T. In fact, the National High Magnetic Field 
Laboratory (NHMFL), having branches at Florida State University, Univer- 
sity of Florida and Los Alamos National Laboratory (LANL), USA, reached 
100.75 T at the Los Alamos branch in 2012. The Dresden High Magnetic 
Field Laboratory (Hochfeld-Magnetlabor Dresden, HLD) at the Helmholtz- 
Zentrum Dresden-Rossendorf, Germany, reached 91.4 T in 2011, a record at 
the time, and 94.2 T in 2012. 

So for practical purposes, the only way to achieve high energies is to increase 
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FIGURE 1.21: Layout of the Cooler Synchrotron (COSY) ring at the Insti- 
tute of Nuclear Physics (IKP) at Forschungszentrum Jülich, Germany. (Cour- 
tesy Forschungszentrum Jülich GmbH.) 


the deflection radius p. This represents a significant practical limitation to 
continuous beam accelerators, in which B must be time independent and 
the size of the orbits increases in the acceleration process, since any region in 
which the beam may come has to be covered by magnetic fields. So for really 
high energies, the only realistic option is to have the particles follow the 
same orbit all the time by ramping the magnetic field during acceleration, 
and thus confine the region that has to be covered by the magnetic field. 


Of course this ongoing adjustment of the magnetic field during the accel- 
eration process according to eq. (1.13) to maintain constancy of p prevents 
continuous injection and hence continuous beams. Furthermore, since electric 
field strengths are comparatively more limited, the fields of the cavities have 
to be re-utilized many thousands of times, resulting in a rather stretched-out 
acceleration process, and thus a rather low repetition rate of beam pulses. 


All these thoughts lead to the concept of the synchrotron, in which the 
magnetic field strength is synchronized with the momentary energy or mo- 
mentum of the particle so as to maintain a constant location of the reference 
orbit. The first generation of synchrotrons uses inhomogeneous dipole magnet 
to bend and confine the beam transversely, which is essentially the same as in 
a betatron. The only difference is that the acceleration is achieved through 
RF cavities. Fig. 1.20 shows an example of such a machine. The main limit of 
this kind of synchrotron is that the transverse focusing force from the gradi- 
ent magnet is very weak, resulting in large beam pipes and magnets. For this 
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FIGURE 1.22: Layout of the Super-ACO light source storage ring at Labo- 
ratoire pour l'Utilisation du Rayonnement Electromagnétique, Orsay, France. 
(From M. P. Level, et. al., in Proc. PAC 1987, OSTI ID: 5125784, CONF- 
870302-Vol.1, 470, 1987 [44].) 


TABLE 1.1: Examples of hadron synchrotrons 


250 GeV 


reason, they are called weak focusing synchrotrons. Alternating gradient 
focusing offered orders of magnitude stronger focusing, much smaller beam 
size and much smaller magnets. Since the mid-1950s, alternating gradient 
synchrotron, also called strong focusing synchrotron, has replaced the 
weak focusing synchrotrons. Nowadays, almost all high energy accelerators 
are strong focusing synchrotrons. Fig. 1.21 shows one example, which is the 
layout of COSY, the COoler SYnchrotron at Forschungszentrum Jülich [3]. 
Table 1.1 shows characteristic features of some of hadron synchrotrons. 
Shown are the Relativistic Heavy Ion Collider (RHIC), at Brookhaven Na- 
tional Laboratory (BNL), Upton, New York, USA, the Tevatron (1987-2011) 
hosted at Fermi National Accelerator Laboratory (Fermilab, FNAL), Illinois, 
USA, and the LHC with their approximate dimensions, the maximum ener- 
gies (per nucleon for ions) for which they are designed, and particles to be 
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FIGURE 1.23: Layout of the Advanced Light Source (ALS) at Lawrence 
Berkeley National Laboratory, California, USA. (From Document Control 
Center of Lawrence Berkeley National Laboratory, Print number: 2202593, 
1989. Courtesy Lawrence Berkeley National Laboratory.) 


accelerated. 

The storage ring is not an accelerator in the traditional sense, since it 
holds the energy of the stored beam constant; however, it does not necessarily 
mean that RF cavities are not needed. In fact, due to synchrotron radiation, 
all the electron and the high energy proton storage rings use RF cavities to 
maintain the energy of the beam. Naturally, a synchrotron often can play the 
role of a storage ring as well. 

The time that the particles stay in the storage rings ranges from minutes 
to days. In the case of the Tevatron, where the circumference is 6.28 km, the 
time for one operation while having collisions for high energy experiments is 
about 8 hours, which is 28800 sec. So the particles circulate through the ring 


_ 3 x 105m/ sec -28800 sec 
i 6.28 x 109m 


As a comparison, the operation duration without collision is more than 100 
hours at the Tevatron. The LHC has similar numbers. Thus, even more so 
than in the case of the synchrotron, one of the main design problems and 
physically perhaps the greatest challenge is to try to ensure that particles 
actually stay contained over this large number of turns. Because the motion 
is nonlinear, this immediately leads to questions of nonlinear dynamics with 
all their complicated and interesting aspects. 

One of the applications of storage rings is the collider, where counter 
rotating beams are brought to collision at various points around the ring. At 


zz 10? turns. 
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very high energies, colliders have a significant energy advantage over fixed 
target machines because a very large fraction of the beams' energies can be 
converted to reaction energy. As a detailed study of the relativistic dynamics 
shows, this is not at all the case for fixed target cases; in fact, conservation 
of energy and momentum severely limits the energy that can be set free. 
Large scaled circular colliders are those listed in Table 1.1, and the tunnel 
with circumference 27 km, hosting the LHC currently, was earlier used for 
the Large Electron-Positron Collider (LEP) (e*,e^; ~ 100 GeV, 1989-2000). 
Besides the energy advantage, storage rings also have the disadvantage of the 
slow ramping times typical for synchrotrons; however, once the beam is stored, 
it is essentially continuous again. 

But also for situations that require the beam to hit a fixed target, storage 
rings often offer an advantage over the use of synchrotrons by themselves, 
because it is often possible to extract the beam much more slowly than in the 
case of the synchrotron, resulting in a more easily manageable duty cycle and 
reducing the problem of overflowing the electronics in the detectors. In this 
method of ultra-slow extraction, the nonlinear dynamics of the device is 
adjusted very carefully and gently, as over time a larger and larger part of the 
originally stored emittance becomes unstable. If it is possible to control the 
location around the ring where the spilling occurs, then the spilled particles 
can be directed toward the fixed target as needed. One storage ring where 
this approach is utilized is COSY, the cooler synchrotron and storage ring, at 
Forschungszentrum Jülich, Germany, shown in Fig. 1.21. 

Another application of the storage ring that has become one of the most 
productive tools for scientific research is the synchrotron light source. 
Although not as majestic as the giant high energy colliders, there are many 
synchrotron light sources throughout the world, and each facility hosts many 
users from almost all disciplines of the sciences. In the light source, the probe 
for the experiments is the light (from far infrared to hard X-ray) radiated 
by the electrons when the orbits are bent in the ring, which is generated 
through the process of synchrotron radiation. Figs. 1.22 [44] and 1.23 
[42] show a couple of synchrotron light sources. As the electron mass is so 
small, in principle, any bending magnet can be used to produce light due 
to synchrotron radiation. But in addition, in the straight sections of a light 
source ring, often wigglers and undulators, which consist of alternating 
short bending magnets, are placed to produce more intense and coherent light. 
In such a way, each light source ring can hold tens of light beamlines, much 
more than the number of interacting locations that a high energy physics or 
nuclear physics collider can have for the collider experiments. 


Chapter 2 


Linear Beam Optics 


In the discussion of the basic physical principles of the various types of ac- 
celerators, we casually neglected the fact that it is necessary to take care of 
more than one particle. In fact, all the above accelerators have to be able 
to simultaneously deal with an ensemble of particles with similar phase space 
coordinates, which is what the sources deliver, and hence with a beam. As 
outlined above, a detailed understanding of the motion of the beam requires 
the study of the motion of the reference particle as well as the motion of 
the relative coordinates. 


In the case of accelerators, our demands on the relative motion are mostly 
that the beam does not become unreasonably large, and hence that the motion 
is somehow bounded within a suitable volume of phase space. While this 
appears to be a modest wish for long single pass accelerators, and more so for 
repetitive systems, this problem actually turns out to be rather nontrivial. 


For other types of systems, more specific requirements have to be made for 
the beam. For example, to maximize the number of collisions at an inter- 
action region of a collider, it is important to “squeeze” together the spatial 
coordinates of the beam, which under conservation of phase space volume 
then requires the momentum coordinates becoming large. Devices like parti- 
cle spectrographs or electron microscopes have different and often even more 
involved requirements. 


In all of these cases, it is important to study the relative motion carefully. 
As a first step, the motion is linearized, and for higher precision, the nonlinear 
effects of the motion have to be studied. Because the volume in phase space 
occupied by a beam is small, these nonlinear effects are often treated in a 
perturbative way, in which the first order corresponds to linear motion, 
and nonlinear motion appears as higher order (see Table 2.1). 


TABLE 2.1: Classification of effect 


zeroth order motion of reference particle 
first order linear motion 
second+higher orders nonlinear motion 


DOI:10.1201/b12074-2 31 


32 An Introduction to Beam Physics 


Reference Orbit 


FIGURE 2.1: Reference orbit, arc length s along it and local coordinates. 


2.1 Coordinates and Maps 


Usually when studying dynamics, the time t plays the role of the indepen- 
dent variable, and we study the motion of positions 7 and velocities v or mo- 
menta p as coordinates. Using the Lagrange mechanism, it is easy to transfer 
to new coordinates, in particular the coordinates that describe the relative 
dynamics around the reference orbit. Furthermore, instead of using t, we 
usually use the arc length s along the reference orbit as an independent 
variable. Fig. 2.1 illustrates the concept. 

For the understanding of the motion in relative coordinates, let us assume 
we have studied and understood the motion of the reference orbit. In case 
there is no field at all, this reference orbit will merely follow a straight line. 
Furthermore, there are many devices used in accelerators that have fields, but 
along one given straight line, all the fields vanish, and the device is lined up 
in such a way that the reference particle follows this line. Another important 
device uses magnetic fields, and along the reference orbit one tries to hold 
the magnetic fields constant, in which case the reference orbit is circular, at 
least within the element. In all other cases, it is usually necessary to describe 
the reference orbit by numerically integrating the equations of motion. 

We assume the position and momenta of the reference particle Ty¢r(s), prer(s) 
are known. Here the momentum p is the dynamical momentum as in eq. 
(1.4). As a technical detail, let us also assume that for all points s, we have 
Dret(S) Y Eztab, i.e., the motion is never pointing vertically straight (which 
for most real accelerators is no limitation whatsoever). Let furthermore ftube 
be smaller than the minimum radius of curvature that the reference orbit 
experiences in the section of the device that we want to study. We now 
consider a “flexible tube" of radius rjype centered around the reference orbit, 
and restrict the particles that we want to describe to only those within the 
tube, as Fig. 2.2 illustrates the situation. Again, for practical devices this 
represents hardly a limitation; for example, in the LHC (see Table 1.1), the 
“tube” would be more than 2 km wide, much larger than the region required 
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FIGURE 2.2: Motion of particles inside the tube with radius riype around 
the reference orbit. 


by the beam particles. 

For any particle within the tube, there is now a closest point on the 
reference orbit; because only particles within the tube are allowed, this point 
is indeed unique. Let s be the arc length at this point, and r-e¢(s) the position 
of the reference particle on the reference orbit. Then the relative coordinates 
of the point 7 are obviously F — 7yer(s). 

Let now é, be a unit vector in the direction of Dref. Consider now the plane 
perpendicular to é;. Of all the unit vectors in this plane, let ëy be the one with 
the largest vertically upward component; because in our setup Pret and hence 
€, are not allowed to go vertically straight, this vector is well defined. Finally 
choose a third vector &; as € = Ey x €. Because €, has a maximum vertically 
upward component, €; has a vanishing vertical component and hence lies in 
the horizontal plane. 

Denote now by “x” the component of 7 — r;;r(s) in the direction of €, and 
by “y” the component of  — 7-e¢(s) in the direction of €}. Similarly, define p; 
and p, to be the momentum components of p — Pref in the directions €; and 
ey: 
Using (2, Pr, Y, py], the motion in the transversal plane, defined by & and 
€y, can be described, and it is called the transversal dynamics. However, 
considering how a beam is formed as we have seen in Chapter 1, we have to 
consider that the energy of a particle E in the beam can be different from 
that of the reference particle Eyer, even if it is only slightly so. The energy 
difference of the particles as well as the geometry of the orbits also results in 
the difference of the travel time t of the particles, called the time-of-flight. 
Thus, the energy and the time-of-flight have to be considered when studying 
the motion of a beam, and it is called the longitudinal dynamics. 

As we will see later, the transversal motion and the longitudinal motion are 
in general coupled, except for special cases. Altogether, we will describe the 
motion of the beam in six coordinates [z, pz, y, py, E, t). The actual choice of 
coordinate quantities requires a careful consideration, as it eventually deter- 
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mines how widely the resulting derivations are compatible with general con- 
cepts in physics and mathematics. Below, the quantities with the subscript 0 
are meant to indicate the reference particle. 

Before launching the motion, we denote the energy deviation of a particular 
particle of interest by 6 by defining 


where K is the initial kinetic energy of the particle under consideration while 
Ko is that of the reference particle. Finally, we introduce a space-like variable 
l 

l=kK (t E to) 


being the deviation of the time-of-flight t from that of the reference particle, 
multiplied by a constant « that has the dimension of velocity. Specifically, 


Yo 
1+ 0’ 


K = —v9 (2.1) 
using the absolute value of the velocity of the reference particle vy and the 
associated yo, which can be expressed as 


1 (poc)? + (me)? _ Eo 


qU — S ACER Ape 
1— vec? mc? mc?" 


by referring to eq. (1.6). The specific form of «, especially the fractional factor 
involving yo, is important for generating what turns out to be a canonical pair 
of coordinates (l, ô); the details go beyond the scope of this book, and we refer 
to [5] for details. 

Then we form the vector Z of particle optical coordinates as 


p 
a = ps / po 
Z=| 4 (2.2) 
b =py/Po , 
l = k(t — to) 
ô = (K E Ko) /Ko 


where po is some previously chosen scaling momentum; a natural choice is 
to select the momentum of the reference particle at the beginning. Likewise, 
Ko is a previously chosen scaling energy, for example the kinetic energy of 
the reference particle, and similarly, « is a scaling quantity introduced in eq. 
(2.1). 

Note that due to the definition of Z , the reference particle itself corresponds 
to Z = 0, and hence the vector Z does indeed describe the relative motion. 
In a seemingly simple way, most of the problems of beam physics now revolve 
around the question as to how Z evolves as a function of s. 
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In light of this, the entire action of a beam physics device can now be 
expressed by how it manipulates the coordinates in Z.In fact, usually a set of 
initial conditions Zp at position sg uniquely determines the future evolution 
and hence Z at any later position s. While a common notion, mathematically 
this determinism of classical mechanics rests on some subtle assumptions 
about the details of the fields that are allowed in the motion; but further 
details are beyond this book. 

Assuming that indeed Žo at so uniquely determines the future evolution, 
we can define a function relating the initial conditions at sọ to the conditions 
at s via 


> 


Ž (s) = M (s, 80) (Z(s9)). 


The function M (so, s), which formally summarizes the entire action of the sys- 
tem, is of great importance for the description and analysis of beam physics 
systems. It is often called the transfer function, the transfer map, or 
simply the map of the system. Note that the transfer maps satisfy the rela- 
tionship 


M (82,81) o M (s1, 80) = M (s2, $0) , (2.3) 


which merely says that transfer maps of systems can be built up from the 
transfer maps of the pieces. 
Since M describes the motion in relative coordinates, we always have 


M(0) = 0. 


Furthermore, since by the very definition of a beam, the coordinates of Z are 
“small,” M is usually only weakly nonlinear. Because of this, its deter- 
mination and analysis is very amenable to perturbative techniques. The 
first step in this process is to consider only the linearization M of M, the 
so-called linear map. Let A = M-M be the remaining purely nonlinear 
part, so that we have 


M -M +N. 


The linear map M is simultaneously the most important and the easiest to 
study. The treatment of the nonlinear part V is much more complicated, and 
only later in the book will we address a small part of the problems associated 
with its treatment. More details can be found, for example, in [5]. 

In the following section, we will make a short excursion to a field that at 
first glance appears disconnected from beam physics, namely the field of glass 
optics. However, a closer look shows that glass optics, which has existed long 
before the name beam physics was introduced, certainly belongs to this field: 
the ensembles of light particles or rays typically associated with questions of 
glass optics form a beam not only in the conventional meaning of the word, 
but also under the more formal definition. 
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2.2 Glass Optics 


As one may recall from a basic course in optics, a distinction is made be- 
tween so-called “Gaussian optics," which indeed turns out to just mean 
linear motion, and “aberrations” that describe nonlinear effects. Optics has 
developed its very own jargons and techniques, some of which are connected 
to complicated geometric ideas, and in our opinion it is historically unfortu- 
nate that optics has not been treated with the methods of the transfer map. 
We shall remedy this situation here by simultaneously providing a short intro- 
duction on Gaussian optics in an appealing and unified way, and also develop 
our skills in dealing with linear maps. 

For simplicity, let us restrict ourselves to systems that are rotationally sym- 
metric, like most glass optical systems; it will be quite clear as we go what has 
to be done to treat non-rotationally symmetric systems. In this rotationally 
symmetric case, two variables are enough to study the motion; we here choose 
them as the position x and the slope a of a ray. The transfer map of an optical 
system then expresses how (x,a) behave as they transfer a system, and we 


have 
[o e d fo 
ag ay 


In fact, if we restrict ourselves to linear motion, then this can be expressed in 
terms of a transfer matrix 


E Ge (ola) 
(ala) (ala) 
Note that the notation for the matrix elements is such that the quantity 
before the vertical line “|” describes the row, and that after the vertical 
line describes the column. We remind again that knowing matrices of pieces 
allows the computation of matrices of more complicated systems, which is 
here achieved by mere matrix multiplication. Indeed, if Ma through My, are 


the matrices for the subsystems, then because of the associativity of matrix 
multiplication, we obtain for the ray after the last subsystem: 


Tn+1 — y n y vi e = y eM 1 
toe pete een 

So we have shown that the matrix of a combined system equals to product 
of matrices of subsystems. Since especially on computers it is very simple 
to multiply matrices, this is the method of choice for the basic design of 


optical systems. In the following, we hence derive the forms of the matrices 
of common optical elements. 
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FIGURE 2.3: A ray passing through a drift. 


2.2.1 The Drift 


'The simplest part of glass optical elements is a region which does not contain 
any material, the drift. The final position and slope x2 and a» after a drift 
of length l can be connected very simply to the initial values x; and aj, as 
shown in Fig. 2.3 


z2 = 11 a,-l, Q5 = Q1. 


'This obviously can be written in a matrix form as 
T2 = 1 l X1 
ag i 0 1 1 s 
For the later discussion it is important to note that the matrix (4 | ) depends 


only on the characteristic properties of the element, which here is the length 
lL. On the other hand, the vector (z1,a1) depends only on the parameters of 
the ray. Altogether, a drift performs a linear transformation in x, a space. 
Note that the determinant of the drift matrix is unity. 

As a small exercise, let us now consider a combination of two drifts of lengths 
lı and l2. For the value of the coordinates (x3,a3) after the combination of 
the two drifts, we have 


ol) = na) Gaeta tn ca) 


Here the necessary composition of maps just reduces to a common multi- 
plication of transfer matrices. And the result is not surprising, the effect 
of two subsequent drifts is just the same as that of a drift of the combined 
length. 


2.2.2 The Thin Lens 


Besides empty space, glass optical devices contain lenses that change the 
direction of the light ray. Here we are primarily interested in the thin lens, a 
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FIGURE 2.4: A bundle of parallel rays passing through a focusing lens. 


somewhat idealized device without any length, which is characterized by the 
following facts that are also illustrated in Fig. 2.4. 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light is unified in one point a distance f after the 
lens. 


3. A ray lighting the center of the lens goes straight through. 


The quantity f that describes the lens is called the focal length. Let us 
now consider a ray passing through the lens. From Fig. 2.4 we find 


t= £1, p= fai, zı +a2: f =p, 
from which we infer à» 
1 
T=, A= E + 04. 


'This relationship can be written in a matrix form as 
X2 u 1 0 X1 
(re) = Cur 1G) ea 
As in the case of the drift, the matrix Cus a) depends only on the focal 
length f, the characteristic property of the lens, whereas the vector (21, a1) 


depends on the ray. Note that the determinant of the matrix (. Jf 2) is 


unity. 

The simple thin lens we have discussed here, the so-called focusing Gaussian 
lens, represents quite an approximation for several reasons. First, any real lens 
performs a refraction at two different surfaces, so positions do change as one 
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FIGURE 2.5: A bundle of parallel rays passing through a defocusing lens. 


goes through the lens. Furthermore, for most lenses it is not really true that 
parallel rays all meet at a point a distance f behind the lens. This is connected 
to the fact that lenses are usually ground with spherical surfaces because 
anything else is technically difficult. Furthermore, the glass has dispersion, 
so different colors are affected differently. We note however that Snell's law 
still allows us to determine the true transfer map of a thick, spherical lens 
in a rather straightforward way. It is important to note, however, that this 
transfer map will no longer be linear. 

Quite interesting is the combination of two glass lenses, which can ap- 
parently be described by multiplying their matrices. Note that, always, the 
matrix of the first element is on the right. We obtain 


m au D) aiibi a 


So the combination of two lenses provides the same effect as one lens with 
focus length f, where 1/f = 1/ f; + 1/ f». This is of course a famous law of 
optics, the derivation of which is all but trivial in the matrix context. Indeed 
the efficiency of the matrix approach becomes clear when observing how to 
prove this law using the standard geometric method of optics textbooks. 

In a similar way as the focusing thin lens we can also treat the defocusing 
thin lens. In this case, the basic properties can be found as illustrated in Fig. 
2.5. 


1. Positions are not changed, but directions are changed. 
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2. Any bundle of parallel light exits the lens in such a way that it appears 
to come from a point a distance f in front of the lens. 


3. A ray lighting the center of the lens goes straight through. 


In a similar way as before, we can use basic geometry to determine the 
action of the lens. From Fig. 2.5, we find 


X2 = T1, p=-f:a, p= z2 — f: az. 


Similar to before, we obtain ag = z1/f + a1, which is in a matrix form 


X2 nS 1 0 Tı 

ag E 1/f 1 ay i 
This is essentially the same matrix as before, except that now the sign of 
the matrix element (a|x) has changed. Indeed, using the standard convention 


to count defocusing lenses with a negative focal length, the matrix has even 
exactly the same form as before. 


2.2.3 The Thin Mirror 


Besides lenses, mirrors are probably the second most important optical 
device, and there are also focusing and defocusing mirrors. Different from 
the lens, the reference orbit flips direction when hitting the mirror. A thin 
focusing mirror is defined by what it does to an ensemble of parallel light via 
three conditions, illustrated in Fig. 2.6. 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light that is reflected by the mirror will meet in 
a point a distance f in front of the mirror. 


3. A ray hitting the center of the mirror is reflected such that its outgoing 
angle equals its incoming angle. 


A similar argument as in the case of the focusing lens shows that the transfer 
matrix of the focusing mirror is 


ne ae 


There is also a defocusing mirror, defined by three conditions: 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light that is reflected by the mirror seems to 
emerge from a point a distance f behind the mirror. 
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mirror 


FIGURE 2.6: A bundle of parallel rays is reflected by the focusing mirror. 


3. A ray hitting the center of the mirror is reflected such that its outgoing 
angle equals its incoming angle. 


A similar argument shows that also in this case, we have the transfer matrix 


m= (uy 1) 


where the convention to count the focal length f of a defocusing element 
negative is used. 

So apparently mathematically, lenses and mirrors behave the same, aside 
from the fact that they reverse the reference orbit. The choice of which to use 
in practice depends on a variety of practical factors. For situations requiring 
only small apertures like in most camera lenses, glass lenses are easily made, 
and have an advantage because of the straight beam path. For situations 
requiring large apertures, like in big telescopes, mirrors are the primary choice 
because it is much easier to manufacture and support large mirrors than large 
lenses. It is also easier to produce non-spherical shapes for mirrors than for 
lenses. Finally, mirrors have the additional advantage that they treat light of 
different colors equally; they do not show the dispersion commonly observed 
in glass lenses. 


2.2.4  Liouville's Theorem for Glass Optics 


As a direct consequence of the matrix notation for glass optics introduced 
above, for any combination of lenses, drifts and mirrors, we can prove a special 
case of Liouville's theorem: The volume of phase space occupied by the beam 
is conserved. 

Indeed, let us assume that we have an optical system consisting of n ele- 
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FIGURE 2.7: Liouville’s theorem. The volume of phase space occupied by 
the beam is conserved. 


ments with matrices M. 'Then we have 


(ea) = Me e CR) )) Qe 9) (2) 


The determinants of each of the matrices M; are just unity, as they are all 
either drifts, lenses or mirrors, so the determinant of the product is unity. 
Under linear transformations, volumes in space transform with the size of the 
determinant, thus the volume is indeed conserved. Fig. 2.7 illustrates this 
situation. 

An interesting and remarkable consequence of Liouville's theorem is the 
famous recurrence theorem of Poincaré. Let us assume we have some 
motion in n-dimensional phase space and also that we know that the motion 
is bounded in all phase space variables. Let us further assume that the mo- 
tion obeys Liouville's theorem, which as we shall see later is the case for all 
Hamiltonian systems, and let the motion be deterministic. Then Poincaré's 
recurrence theorem states that for any given e, the system after sufficient time 
comes back to its original state within a tolerance of at most e. 

Before we sketch the proof of Poincaré's theorem, let us illustrate some of 
its consequences. Consider for example a box with classical gas particles that 
are initially all located in one side of the box and kept there by a wall as 
shown in Fig. 2.8. After the wall is removed, the gas particles will distribute 
in the box evenly, as we expect from classical statistical mechanics, increasing 
their entropy. But their phase space is bounded, as the particles cannot leave 
the box, and each particle's momentum is limited by the total heat energy 
contained in the box. 

So as time progresses, according to Poincaré, they will at one time in the 
future just recollect on one side of the box, and by re-inserting the wall, 
they will be caught again on one side, in crass contradiction to the entropy 
principle. 

There are many other examples. If we have a particle beam in an accelerator 
that we know is stable, it will eventually come back as close as we want in 
phase space, which is an effect that is actually observed somewhat routinely in 
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later eventually 


> => 


FIGURE 2.8: Poincaré’s recurrence theorem. After sufficient time, the 
system returns close to its original state. 


tracking pictures. Even for our daily life, there are important consequences. 
If the universe is Hamiltonian and it does not expand indefinitely, then up to 
minute details, history will keep repeating itself. So we will all be born again, 
and we will all make the same mistakes all over, but since now we cannot 
remember anything about our past life, also next time we will not remember 
our current life. 


Now let us sketch the proof of the recurrence theorem. Let an & be given, 
and consider an &-ball with volume V; in phase space. Consider its motion by 
regular time steps At. Since the total available phase space volume is finite, 
say Vp, after at most V,/V- time steps, the image of the ball must reach a 
part of phase space it has touched before, i.e., it must overlap a previous 
image of the ball. Let us assume this happens after N steps and the previous 
image is that after n steps, with n < N. But if the images after n steps In 
and after N steps Iw overlap, so must the images after (n — 1) and (N — 1) 
steps, respectively. And continuing backwards, so must the images after 0 and 
(N — n) steps; hence, after (N — n) steps, we touch the original ¢-ball again. 


2.3 Special Optical Systems 


In this section we want to apply the matrix techniques to the study of 
certain special categories of systems. In particular, we associate certain fun- 
damental properties of systems with properties of the matrix. We begin with 
the imaging systems. 


44 An Introduction to Beam Physics 


FIGURE 2.9: Sketch of an imaging system. 


2.3.1 Imaging (Point-to-Point, e e) Systems 


Imaging systems or point-to-point systems are perhaps the most important 
systems in optics, and they deserve some special attention. Suppose we study 
the action of a slide projector. At one end of the projector, light is sent 
through the slide. Suppose the slide shows a man wearing a gold earring. 
The image of this man is to appear on the screen, and the gold earring is to 
appear at one particular location. This requires that all light passing through 
the golden spot on the slide and emanating in various directions has to be 
re-united at one spot on the screen, as shown in Fig. 2.9. 

This means that the final position of a ray is independent of its initial angle 
and it only depends on the initial position. In terms of transfer matrices 


; (z|x) vid 
M = 
Cis (aa) 
this means that the element (r|a) has to vanish: 
(x|a) = 0. 


Obviously the element (x|x) also has an important interpretation: it is the 
magnification of the system. 


(a|xz): magnification. 


Besides the case of the slide projector, many other devices use imaging. 
They include the camera, the overhead projector, the eye, the photographic 
microscope, the electron microscope, as well as particle spectrographs. We 
will discuss in detail some such devices in Chapter 7. 

It is worthwhile to study how imaging systems can be made. First, we 
observe that a drift is imaging if and only if | = 0, while it is a rather boring 
choice. A single lens is also always imaging as long as there are no drifts 
before and after, but that is another boring choice. The first interesting 
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imaging system is the DLD (drift-lens-drift) system, consisting of a drift, a 
lens and another drift. The transfer matrix of the DLD system is given by 


s-( Gs DG = E) Cu iu) 


E l—-l/f h+l-—ll2/f 
(X -1/f 1- h/f l 


If such a system is supposed to be imaging, we have to satisfy (a|a) = l + 
l2 — lıl2/ f = 0, which is equivalent to 
1 1 1 


o lk f 
This is another important result of conventional optics, which here is obtained 


in an almost trivial way. If the DLD system is made to be imaging, the 
magnification is given by 


This principle is used in several different devices. In the slide projector, 
lı is very small and lə is very large, thus it provides a large magnification. 
Probably the most important imaging system is the eye. Here the situation 
is just the opposite. lı is large and l2 is small, allowing for large things to be 
mapped on the small retina of the eye. 

It is interesting to study the combination of two imaging systems: 


(z|x)2 0 (xj) 0 \ (aja )o (|) 0 
(alx)2 (a|a)a /A (a|z)s (aja) (a|x)2(a|x)1+(ala)o(ale)1 (aļa)2(aļa)ı / 
(2.5) 
As is to be expected, the total system is again imaging, and the magnifica- 
tion is (z|z)2(x|x)1, just the product of the individual magnifications. 


2.3.2 Parallel—-to—Point (|| e) Systems 


As we saw above, the human eye observing a nearby object is one of the 
prime examples of an imaging system. But what happens if the eye looks at 
things farther and farther away, in particular at the stars, a pastime of the 
human race and scientists for eternity? The length of the first drift |; becomes 
larger and larger, and for all practical purposes the light coming from one star 
reaches the eye as a parallel bundle. So what the eye is to interpret now is 
the angle under which the light comes in, and hence the position on the retina 
should depend only on the initial angle at which the light strikes the eye, but 
not on the initial position. 
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FIGURE 2.10: Sketch of a parallel-to-point system. 


FIGURE 2.11: Sketch of a point-to-parallel system. 


This is an example of parallel-to-point systems as illustrated in Fig. 2.10. 
A parallel-to-point system requires that 


(a|x) — 0. 
If we look at the eye as a DLD system, this requires 


(z|z) -1—15/f —0 la — f, 


while lı is arbitrary. Thus the retina has to be exactly at the focal length; 
almost as important is that the distance to the object is arbitrary since we 
cannot change our distance to the stars significantly. Another important 
parallel-to-point system is the photographic camera. 


2.3.3 Point—to-Parallel (e ||) Systems 


Another important class of systems is the point-to-parallel systems. In 
these systems, the final slope depends only on the initial position, but not on 
the initial slope as illustrated in Fig. 2.11. So we have 


(ala) = 0. 


Examples include the flashlight, the microscope, and laser and particle beam 
transports over long distances such as those considered for the SDI Transport 
(Strategic Defense Initiative) considered by the US government in the 1980s. 

As an example, let us try to achieve a point-to-parallel system with a DLD 
combination. We obtain 


(ala) 2 1— h/f =0 ly — f, 
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FIGURE 2.12: Sketch of a parallel-to-parallel system. 


as we may have expected. Note that there is no condition on l5. 

From the transfer matrices, it follows rather directly that the combination 
of a point-to-parallel and a parallel-to-point system forms a point-to-point 
system. 


(ls (as) Fac 0) = (ratae latae Galatea) 


Using the relaxed eye as the parallel-to-point system, we can thus build a 
microscope by placing a suitable point-to-parallel system in front of the eye. 
It is interesting to see how the lengths in a point-to-parallel system have to 
be chosen; by requiring (a|a) = 0, we obtain lı = f, while lz is arbitrary. The 
first part is as expected; the latter part is fairly important for the operation of 
a microscope because it allows the eye to move with respect to the microscope. 


2.3.4  Parallel-to-Parallel (|| ||) Systems 


The final important system is the parallel-to-parallel system illustrated in 
Fig. 2.12. By placing it between the eye and the stars, a magnification of 
angles can be achieved. This is the principle of the telescope. 

The system has to be such that the final slope depends on the initial slope, 
but not on the initial position, which requires 


(a|z) = 0. 
The magnification is given by (ala). 
(ala): magnification. 
If we try to achieve this with a DLD system, then we have to satisfy (aja) = 
—1/f to be 0, which is impossible. This entails that a telescope has to contain 


at least two lenses. 
So let us consider an LDL (lens-drift-lens) system. 


He er 3t 3€ = Cus AL URE un) 
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And we have to satisfy 


1 1 " Don 
fo h dafs 
which is a well-known condition for Newtonian or Galilean telescopes. The 
magnification of the telescope is given by 

l 1 

aja) =1- — =1- — 

(a|a) F 7 

Thus, it requires fı >> f2 to obtain large magnification. Since there is a limit 

on how short f2 can be, it is thus necessary to make fı large, which entails 
what the rather large size telescopes usually have. 


(alz) = 0 => [= fi + fe; 


2.3.5 Combination Systems 


Often the question arises to what extent it is possible to simultaneously 
satisfy the requirements for the above systems. To some extent this is possible, 
but the fact that the determinant of the total system has to be unity due to 
Liouville’s theorem for glass optics imposes some restrictions. 

A closer look shows that 


1. e e and || || is possible: (z|a) = (a|z) = 0. 


2. || e and e || is possible: (z|z) = (ala) = 0. 


All other cases are impossible because they would require a zero deter- 
minant. 


Another important question is what happens when two systems satisfying 
certain properties are combined into one system; for example, we already saw 
in eq. (2.5) that two point-to-point systems placed behind each other again 
produce a point-to-point system. A more detailed analysis shows that of 
the sixteen cases describing combinations of two systems, eight cases lead to 
another special system 


eoteo=ee, | +i =] I, 
eecel-el. Il-le-le. 
e|-|e-ee, |e-ce|-|l; 
e+iil=ell, lletee=le. 


The entries in the table above are easy to memorize because it contains just 
those combinations for which the second symbol of the first system equals the 
first symbol of the second system, and the final result is obtained by dropping 
the two identical symbols. So in compact notation, we have: 


If A,B,C € {e, ||}, then AB+BC=AC. 


Chapter 3 


Fields, Potentials and Equations of 
Motion 


For the study of transfer maps of particle optical systems, first it is necessary 
to undertake a classification of the possible fields that can occur. All fields 
are governed by Maxwell’s equations, which in SI units have the form 


= acr OD) 
div B = 0, owl =j P, 
> 4 B 
div D =p, curlk = a (3.1) 


In the case of particle optics, we are mostly interested in cases in which 
there are no sources of the fields in the region where the beam is located, so 
in this region we have p = 0 and j = 0. Of course any beam that is present 
would represent a p and a j but these effects are usually considered separately. 

In the following, we want to restrict ourselves to time independent sit- 
uations, and neglect the treatment of elements with quickly varying fields 
including cavities. This limitation in very good approximation also includes 
slowly time varying fields like the magnetic fields that are increased during 
the ramping of a synchrotron. 

So, Maxwell's equations simplify to 


div B — 0, curl H = 0. 
div D — 0, curl E — 6 


(3.2) 


where " - 

B= koH, D = €o E. 
Because of the vanishing curl, we infer that E and B have scalar potentials 
Vg and Vg such that 


E=-VVe, B=—VVz. 


Note that here even the magnetic field is described by a scalar potential, 
and not by the vector potential A that always exists. From the first and third 
equations of (3.2), we infer that both scalar potentials Vg and Vg satisfy 
Laplace's equation, and we thus have 


AVg 20, AVg-0. 
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In order to study the solutions of Laplace’s equations for the electric and 
magnetic scalar potentials, we will proceed for two special cases, each of which 
will be treated in a coordinate system most suitable for the problem. 


3.1 Fields with Straight Reference Orbit 


The first major case of systems is those that have a straight reference orbit. 
In this case, there is no need to distinguish between particle optical coordinates 
and Cartesian coordinates, and in particular there is no need to transform 
Laplace’s equation to a new set of coordinates. Many elements with a straight 
reference orbit possess a certain rotational symmetry around the axis of 
the reference orbit, and it is most advantageous to describe the potential in 
cylindrical coordinates with a longitudinal z-axis that coincides with the 
reference orbit. 


3.1.1 Expansion in Cylindrical Coordinates 


We first begin by expanding the r and ¢ components of the potential in Tay- 
lor and Fourier series, respectively. However, the dependence on the cylindri- 
cal “z” coordinate, which here coincides with the particle optical coordinate 
s, is not expanded. So we have 


V =V(r,6,8) = M Mya (s) cos (lo + 6,1) r*. (3.3) 
k=0 1=0 
In cylindrical coordinates, the Laplacian has the form 


18 OV 10V PV 
V(r, 0, 8) = -=> 2-10 5) d 2 99 s 


19 (OV) , Lev OV 
i ar) rg 2 


We insert the Fourier- Taylor expansion of the potential (3.3) into each term 
of the Laplacian. 


,9V = 25 5 Mx1(s) cos (lo + Oka) kr*-1 


M Mas) ) cos (lb + 0,1) kr”, 
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then the first term is 


OV quor m 
r Or (Z) ES S Mas) cos (l + 84,1) K? r^? 


=11=0 


where we let the sum start at k = 0 in the last step, since there is no contri- 
bution anyway because of the factor k?. The second term is 


10?V jo 2k 
229g y »- y Mp (5s) cos (ld + 0k) l^r 
i T ke 120 
AE V Mpls) cos (1d + 0.1) 2r*-?. 
k=0 1=0 
The third term is 
82V oO oo 
Os? - MM Mis ) cos (lo + 0.1) r^ 
k=0 l=0 
= Y, » My. " (s) cos (ld + 02.1) pko2 
k=2 1-0 
= 3 s Mg 3 (s) cos (ld + 0,24) r7, 
k=0 1-0 


where we let the sum start at k — 2 in the second step, and, further, in the last 
step, we used the convention that the coefficient Mp (s) vanish for negative 
indices. Recognizing that all the terms have the common summations and 
the factor r*-?, we obtain the Laplacian for the Fourier-Taylor expansion of 
the potential (3.3) as 


AV = 5 [Mkr (s) cos (164-05,1) (K? — 1°) + Mg. ai (s) cos (lġ+0r-2,1)] pho? 
k, 1=0 


To satisfy Laplace’s equation, we obtain a set of conditions for k,l > 0, 
Mxya(s) cos (lo + Ok) (k? = I?) + Mg 21(s) cos (ld + 0x24) = 0, 


where the second term vanishes for k = 0, 1 because of the negative indices 
for Mx. 

We begin the analysis of Laplace's equation by studying the case k — 0, 
where only the first term matters. Apparently Mo, and o,o can be chosen 
freely because k? — |? = 0 for k = l = 0. For l > 1, we infer Mo, = 0. 
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By induction over k, we now show that Mp ı = 0 for all cases where k < l. 
Apparently the statement is true for k — 0 as we just showed. Now let us 
assume that the statement is true up to k— 1. If k < l, also k—2 < l, and thus 
My ,,(s) = 0. Since k? — 1? 4 0 and cos (lọ + 0,1) # 0 for some ¢ because 
LA 0, this requires 

Mxy(s) =0 for k<l. 


Thus the infinite matrix My, is strictly lower triangular. 
We now study the situation for different values of l. We first notice that for 
all l, the choices of 


Mis) and Oi are free 


because M” > (s) = 0 by the previous observation, and k? — 1? = 0 for k = I. 
Next we observe that the value Mj41(s) must vanish, because k? — I? Æ 0, 
but M7’ , ;(s) = 0 because of the lower triangularity. Recursively we even 
obtain that 


Mi+1,(s),Mi+3,1(s),.-. vanish. 


On the other hand, for k = l +2, we obtain that 0)42 = 011, and Mi+2,1(s) 
is uniquely specified by Mi(s). Applying recursion, we see that in general 


i1 = baat = O44. =E 
2n 
MS (s) 


Mis2n4(8) = Wr (8 - t2 


(3.4) 


Let us now proceed with the physical interpretation of the result. The 
number l is called the multipole order, as it describes how many oscillations 
the field will experience in one 27 sweep of ¢. The free term Mi(s) is called 
the multipole strength, and the term 0;, is called the multipole phase. 
Apparently, frequency / and radial power k are coupled: The lowest 
order in r that appears is l, and if the multipole strength is s-dependent, also 
the powers l + 2,1+4,... will appear. 

For a multipole of order l, the potential has a total of 21 maxima and 
minima, and is so often called a 2/ pole. Often Latin names are used for the 
2l poles, and they are listed in Table 3.1. 

In many cases it is very important to study the Cartesian (and hence also 
particle optical) form of the fields of the elements. We start with the trivial 
case with k = 1. In this case, the potential is V = M,1cos(¢+ 011) r. For 
0,1 = 0, we obtain V = Mj, - x, which corresponds to a uniform field in 
x-direction. For 011 = 7/2, another important sub-case, we obtain V = 
—M,1-y, which corresponds to a uniform field in y-direction. In both of 
these cases, the reference orbit is indeed a straight line only in the limit of 
weak fields. 
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TABLE 3.1: A list of multipoles 


l Leading Term in V Name 

1 Miai(s)cos( ó--011)r Dipole 

2 Mo22(s)cos(2¢+ 62.2)r? Quadrupole 

3. Mss) cos (3¢ + 033) r? Sextupole/Hexapole 
4 Maa(s)cos (40 + 044) r^ Octupole 

5 Ms,5(s)cos(5¢+65.5)r° | Decapole 

6  Ma,e(s) cos (60 + 05,5) r? Duodecapole 


3.1.2 Quadrupole Fields 


The case k = 2 leads to quadrupoles, and the potential has the form V = 
M3 2 cos (26 + 05 3) r?. Particularly important in practice will be the sub-cases 
05.5 = 0 and 02.2 = 7/2. In the first case, we have 


V = M3,» cos (20) r?° = M22 (cos? ¢ — sin? 9) r? = Ma; (x? — y’), 
and in the second case we have 
V = Ma;cos (26 + z) r? = — Maa sin (24) r? 
= — Mna.» (2sinócos $) r? = — M» 2xy. 


All other angles 02,2 lead to formulas that are more complicated; they can be 
obtained from the ones here by subjecting the x, y coordinates to a suitable 
rotation. This again leads to terms of purely second order. 

Because the potential is quadratic, the resulting fields E or B are linear. 
Indeed, the quadrupole is the only s-independent element that leads to 
linear motion similar to that in glass optics, and thus has great importance. 

In the electric case, one usually chooses 05.5 — 0, thus having V — Ma »(z?— 
y?) and resulting in the fields 


Ez = —2M2 2 "T, E, = 2M2» | y. 


'The fields extend throughout the length of the device, and thus provide strong 
focusing. Different from the case of glass optics, it turns out that the motion 
cannot be rotationally symmetric anymore. If there is focusing in the 
x-direction, there is defocusing in the y-direction, and vice versa. This effect, 
completely due to Maxwell’s equations, turns out to be perhaps the biggest 
nuisance in beam physics; i.e., if one uses piecewise s-independent particle 
optical elements, the horizontal and vertical planes are always different 
from each other. 

To make an electrostatic device that produces a quadrupole field, it is best 
to machine the electrodes along the equipotential surfaces, and utilize the fact 
that if a sufficient amount of boundary information is specified, the field is 
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y 


FIGURE 3.1: Ideal electrodes of an electrostatic quadrupole. 


uniquely determined, and hence must be as specified by the formula used to 
determine the equipotential surfaces in the first place. So in practice, the 
electrodes of an electric quadrupole often look as shown in Fig. 3.1. 
In the magnetic case, one chooses 05» = 7/2, thus having V = — M2 2- 2xy 
and resulting in 
By = 2M22 "Y, By = 2M2» "T, 


and looking at the Lorentz forces that a particle moving mostly in s-direction 
experiences, we again see that if there is focusing in x-direction, there is 
defocusing in y-direction and vice versa. 


3.1.3 Sextupole and Higher Multipole Fields 


To study higher orders in k, let us consider the case k = 3. For 05,3 = 0, we 
obtain 


V = M3 3 cos (39) r? = Ma, (cos? d — 3cosósin? p) r° = M3,3 (£? — 3ay’). 


In this case, the resulting forces are quadratic, and are thus not suitable 
for affecting the linear motion; but we shall see later that they are indeed 
very convenient for the correction of nonlinear motion, and they even have 
the nice feature of having no influence on the linear part of the motion. 
Another important case for 03,3 is 63.3 = 7/2, in which case one can perform 
a similar argument and again obtain cubic dependencies on the position. 

For all the higher values of l, corresponding to octupoles, decapoles, 
duodecapoles, etc., the procedure is very similar. We begin with the addition 
theorem for cos(/¢) or sin(l¢), and by induction we see that each consists of 
terms that have a product of precisely | cosines and sines. Since each of these 
terms is multiplied with r’, each cosine multiplied with one r translates into 
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FIGURE 3.2: The s-dependence of the multipole strength. 


an z, and each sine multiplied with one r translates into a y. The end result 
is always a polynomial in x and y of exact order I. 

Because of their nonlinear field dependence, these elements will prove to 
have no effect on the motion up to order / — 1, and thus allow us to selectively 
influence the higher orders of the motion without affecting the lower orders. 
And if it is the crux of particle optical motion that the horizontal and vertical 
linear motion cannot be affected simultaneously, it is its blessing that the 
nonlinear effects can be corrected order-by-order. 


3.1.4 s-Dependent Fields 


In the case where there is no s-dependence, the potential terms that we have 
derived are the only ones; under the presence of s-dependence, as shown 
in eq. (3.4), to the given angular dependence there are higher order terms 
in r, the strengths of which are given by the s-derivatives of the multipole 
strength Mj ;. The computation of their Cartesian form is very easy once the 
Cartesian form of the leading term is known, because each additional term 
just differs by the previous one just by the factor of r? = (x? + y?). 

In practice, of course, s-dependence is unavoidable: the field of any particle 
optical element has to begin and end somewhere, and it usually does this by 
rising and falling gently with s, entailing s-dependence as seen in Fig. 3.2. 
This actually entails another crux of particle optics: even the quadrupoles, 
the “linear” elements, have nonlinear effects at their edges, requiring 
higher order correction. The corrective elements in turn have higher order 
edge effects, possibly requiring even higher order correction, etc. In practical 
terms, charged particle optical systems are designed in such a way that the 
effect of the higher order field is smaller than that of the lower order field, 
which ensures that the iterative process converges. 

Without s-dependence, the case | — 0, corresponding to full rotational 
symmetry, is not very interesting since there will be no field left. This becomes 
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FIGURE 3.3: Scalar potential (solid), longitudinal (dashed) and radial 
(dotted) field distribution along the s-axis of a single (left) and dual (right) 
step in potential of a rotationally symmetric lens. 


clear if we rewrite eq. (3.4) for the case | = 0, which is 


M&S” (s) MS (s) 
Mon.o(s) = e) = HT UNES 3.5 
957 m CDQo ^ (D C4) i 
(2n) 


When Mg (s) = 0, we obtain that V = Mo,o, which is independent of s and 
r. If we consider s-dependence, it actually offers a remarkably useful effect. 
While there is no r-dependence in the leading term, the contributions through 
the derivatives of Mo,o(s) entail terms with an r-dependence of the form r?, 
r^,.... Using eq. (3.5), we obtain the Taylor expansion of the potential, which 


is 


—;— M$} (s)r#— (3.6) 


Of these, the r? term will indeed produce linear, rotationally symmetric 
radial fields and lead to effects similar to those in the glass lens. In practice 
these fields are not very strong (proportional to Me?) (s), compared to Mie (s) 
for the longitudinal field) and restricted to regions where the potential changes 
and are used in so-called weak focusing. In practice, potential changes often 
occur as transitions between regions of constant potential. This can be done 
as a single step as shown on the left of Fig. 3.3, or as a dual step as shown on 
the right of Fig. 3.3, where the latter has the advantage that no net change 
in potential occurs. 
'The resulting fields are given by 


(1) TR. 1 (yy 
E,(r,s) = — M, -M, 
(r,s) o,o (5) + 1 40,0 (s)r (an? M 0.0 (5)r^ + 
js 35. 
E,(r,s) = 5 Mo.o (5)r - aM o (s)r3 + 


The magnetic field components Bs(r,s), B,(r,s) take the same form. Fur- 
thermore, there are usually quite large nonlinearities, and altogether these 
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FIGURE 3.4: Reference orbit of a bending magnet. 


devices are used mostly for low energy, small emittance beams like those 
found in electron microscopes. 


3.2 Fields with Planar Reference Orbit 


In the case of the straight reference orbit, we saw that Maxwell's equations 
entail a very clean connection between rotational symmetry and radial po- 
tential. As one may expect, in the case of a non-straight reference orbit, this 
is no longer the case. In this situation, Maxwell's equations have a rather 
different but not less interesting consequence as long as we restrict ourselves 
to the case in which the reference orbit stays in one plane. 


3.2.1 The Laplacian in Curvilinear Coordinates 


As it turns out, in this case the arguments to express the Laplacian in the 
new coordinates are similar to that in cylindrical coordinates. Let us assume 
that the motion of the reference particle is in a plane, and that all orbits that 
are on this plane stay in it. Let R(s) be the momentary radius of curvature 
as shown in Fig. 3.4. 


Then we have a situation very similar to cylindrical coordinates r, $, z 
centered around the momentary origin of R(s). In fact, setting h(s) = 1/ R(s), 
the particle optical coordinates x, y, s correspond to the cylindrical ones in 
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the following way: 


As we recall, in cylindrical coordinates the Laplacian had the form 


V( O ma av jd 9 19V PE 4 
ERST Onde Or r ð Nr Ob O0z2' 


So we may expect that in particle optical coordinates, we in fact have 


1 ð OV 1 ð 1 OV 0?V 
UE) E Tim (ccena m a): The Os (Et) Tm 


A careful analysis based on the chain rule and determining the proper Jacobian 
reveals that this is indeed the case. The calculations are rather mechanical 
and not particularly interesting, but very involved [48], and we skip them for 
the purposes of this discussion. 


3.2.2 The Potential in Curvilinear Coordinates 


For the potential, we again make an expansion in transversal coordinates, 
and leave the longitudinal coordinates unexpanded. Since we are working now 
with x and y, both expansions are Taylor, and we have 


V=V(z,y, 8 xS S ain) jm (3.7) 


k=0 1=0 


This expansion now has to be inserted into the Laplacian in particle optical 
coordinates. Besides the mere differentiation, we also have to Taylor expand 
1/(1-4 hz) : 

ixi (ha) + (hz)? — (ha)? + 
After gathering terms and heavy arithmetic, and again using the convention 
that coefficients with negative indices are assumed to vanish, we obtain the 
recursion relation 


ak +2 — — Qj — khay 1 + kh'ay 44 — Gk+2,1 — (3k + 1) haya 
= 3khay 142 = k (3k e 1) hapa s= 3k (k hago 1) h?ay 2112 
— k (k — 1} h5ag 11 — k (k — 1) (k — 2) Pa sia. (3.8) 


Although admittedly horrible and unpleasant, the formula apparently has the 
coefficient of highest total order k + l + 2 on the left hand side, and thus 
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recursively allows the calculation of coefficients. Indeed, the terms a;,o(s), 
ax,1(s) can be chosen freely, and all others are uniquely determined through 
them. 


To study the significance of the free terms, let us consider the electric and 
magnetic case separately. In the electric field, in order to ensure that orbits 
that were in the plane stay there, there must not be any field components in 
the y-direction in the plane corresponding to y — 0. Computing the gradient 
of the potential, we have 


xk! ak 
Ez (2,y=0)=— 5 aro Ey (x,y 50) =- ana =0, 
k i k l 


and looking at Ey, we conclude that aj; = 0 for all k. So the terms ajo alone 
specify the field. Looking at Ey, we see that these are just the coefficients that 
specify the field within the plane, and so the midplane field determines 
the entire field. Furthermore, looking at the details of the recursion relation 
(3.8), it becomes apparent that all second indices are either l or | + 2. This 
entails that as long as aķ,ı terms do not appear, also aj, 3, à5,5, .. . terms do not 
appear. Indeed, the resulting potential is fully symmetric around the plane, 
and the resulting field lines above and below the plane are mirror images. 


In the magnetic field, the argument is rather similar. Considering the 
fields in the plane, we have 


gk-l 


k 
x 
By, (x,y = 0) =— 9g B, (x,y = 0) =-9 ao =0. 
k j k ; 


In order for particles in the midplane to stay there, we must have that B, 
vanishes in the midplane, which entails a;,9 = 0. So in the magnetic case, the 
coefficients aj, specify everything. These coefficients, however, again describe 
the shape of the field in plane, and so again the midplane field determines 
the entire field. In the magnetic case, the potential is fully antisymmetric 
around the plane, and again the resulting field lines are mirror images of each 
other. 


To summarize the findings, 


Electric field: akı =O for all k, azo specify everything. 
Magnetic field: a, = 0 for all k, ap, specify everything. 


To conclude, we note that it is possible to extend the entire discussion also to 
cases where the motion is not confined to a simple midplane. The derivations 
connected to this most general case become exceedingly complicated [49, 48] 
and go beyond what is appropriate for this book. 
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3.3 The Equations of Motion in Curvilinear Coordinates 


There are a variety of methods to derive the equations of motion in curvi- 
linear coordinates with the arc length s as the independent variable. It 
is conveniently done in the Lagrangian picture, in which one first expresses 
Cartesian variables by curvilinear coordinates and rewrites the Lagrangian. 
Then one proceeds to a Hamiltonian through a Legendre transformation in 
the common way. In the Hamiltonian picture, it is then possible to perform a 
change of independent variable from t to s while maintaining the Hamiltonian 
structure [49]. 

While very illuminating, the Lagrangian-Hamiltonian mechanism is too 
involved for our purposes, and we thus follow a more straightforward, clas- 
sical way that leads to the same canonical equations of motion. For simplicity, 
we also restrict ourselves in that the reference orbit is allowed to bend in only 
one plane. 


3.3.1 The Coordinate System and the Independent Variable 


As a function of the arc length s, we first define the momentary curvature 
of the reference orbit as h(s). If the curvature is nonzero, the radius of cur- 
vature is then given by R(s) = 1/h(s). We begin by studying the bend angle 
that the reference orbit experiences as we move from position s to position s. 


We have "s P 
cfu fe hf ies T 


As described in eq. (1.1), in Cartesian coordinates, the equations of motion 
have the Lorentz force form 


<= F=q(£+0x B) =Ze(E+oxB), 


where E and B are the electric and magnetic fields, v is the velocity, and 
q = Ze is the charge of the particle. Since the left hand side of the equations 
of motion contain momentum, it is often useful to express the velocity in terms 
of the momentum. From eq. (1.5), 

Om p 

vV = — = 


i u 
dt V p? + m2c? 


which allows to maintain only the momentum 7 in the equations of motion. 
For the purpose of our derivation, we rewrite the equation as an integral 
equation: 
(8) | $ 
Ko-ms-[ Fos-muo-[ Fas. 
t(s) E 
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where we have used t’ = dt/ds, and it is worthwhile to remind ourselves that 
the force F'(8) still depends on both # and P. 

As we have progressed from s to 5, the orientation of our locally attached 
particle optical coordinate system has changed. It was rotated by the angle 
a of eq. (3.9). By using the rotation matrix 


cosa 0 sina cos T h(s)ds 0 sin T h(s)ds 
R(s) = 0 1 0 |= 0 1 0 ; 
—sina 0 cosa — sin f h(s)ds 0 cos (^ h(s)ds 


we have the momentum in the new rotated local coordinates pi(3) = (pz, Py, ps) 
as 
qm) = R(s) - pts) 
cos[^h(s)ds 0  sinf;h(s)ds " 
= 0 1 0 i (z) + / Feas) é 
— sin fe h(s)ds$ 0 cos J^ h(s)ds i 
where we have expressed the last line in the integral form. In order to obtain 


the rate of change of the momentum pı, we differentiate with respect to s. 
Noting that 


L sin | EERI f A (5)ds, 
S S E] 


d Ei S 
£ cos f h(s)ds = —h(s) sin f h(s)ds, 


evaluate at 5 = s and obtain 


gs) LO + R'(s) (so +f Feas) " 
0 0 h(s) 
= F(s)t' + 0 0 0 p(s). (3.10) 
-h(s) 0 0 


Note that the first term depends on the actual forces and the factor t’ accounts 
for the fact that we went to the arc length s as an independent variable. 
The second term is a pseudoforce due to the fact that we are located in a 
rotating frame. Indeed, for h — 0, we obtain the conventional result. We also 
note in passing that if we were to allow out-of-plane motion of the reference 
orbit, then the matrix R would depend on two curvatures. Unfortunately, in 
this case an additional complication arises from the fact that rotations around 
different axes do not generally commute [49]. 
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FIGURE 3.5: The curvilinear coordinates in the plane of the reference 
orbit. 


Next we make an observation regarding the rate of change at which dis- 
tances are covered at different positions x. Looking at Fig. 3.5, we observe 


dL R+a 


gu zd e 
ds R Ds 


Using this and the components of the momentum, we obtain 


dx dL dx dx — De 
E — as dL = (1 ate ha) = (1 5p WES 
dy dL dy _ dy _ Py 
m wu rp NU i 


For the time-of-flight, we consider the traveling distance divided by the 
velocity, and we obtain 


dt 1 [|(dzN? /dy\? | fdLN? 1 Pi + Py 
ie (=) «(3 «(S x lu a 
1 
= = (1+ ha) 7 (3.12) 


where p = 4/p2 + pz + p? has been used. 

Altogether, we have so far obtained the equations of motion in local coor- 
dinates with s as the independent variable. From there to the particle optical 
variables, only a small step is left. We remind ourselves that the particle 
optical coordinates are (z,a, y, b,1,6] as listed in Table 3.2, where pp and 
to are total momentum and time-of-flight of the reference particle; refer to 
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TABLE 3.2: Optical coordinates 


Coordinate Phase Space 

x Horizontal Position 

à = pz /po Momentum Slope 

y Vertical Position 

b = py/po Momentum Slope 

l — &(t— to) Longitudinal  Time-of-Flight like Variable 
ô = (K — Ko) / Ko Energy Deviation 


eqs. (2.1) and (2.2). Likewise, the subscript 0 will be used below to indicate 
the respective quantity of the reference particle. 

In order to study relativistic effects, it is advantageous to introduce the 
relativistic measure 7, the ratio of kinetic energy to rest mass energy 


|| Kol +ô) — ZeV ZeV 


MC 


where m is the rest mass. The quantity V is the change in energy that is 
incurred due to the passage through electric fields; it is given by 


y- - | Bæn, s, t) - üdt, (3.14) 


where v is the velocity, depending on position and time, of the orbit under 
consideration. In case the electric fields are time independent, the quantity V 
is merely the common electrostatic potential, which depends on the position 
coordinates (x, y, s). For time dependent fields, the quantity V explicitly de- 
pends on the specific time dependent orbit taken, which is of importance for 
the study of dynamics in RF cavities as discussed in Chapter 10. 

Since mc? represents the total energy, we have 


1 
= \/1 — v2 /c? 


Using these, we also have 


=1+7. (3.15) 


1 vant _ vn2@t+n) 


~ (+n) lcm 1-7 


v Ee 2 X 
c 
es T = = ViCi), and 2=m(1+n). (3.16) 
mc me U 
As a first step, using eq. (3.12), we express the rate of change of the particle 
optical variable | = «(t — to) in terms of particle optical quantities: 


di 1 1 
= ant) =0 |= a+- >] 
ds U 


1 
(1+ he) 222o zi RARE [22s cH LN 1 Zo Gai) 
Po V Ps Vo 1+ 0 Ps Vo 
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where the relation tj = 1/vo is used because of pso = po. The term po/Ds 
appearing above can be expressed by the optical coordinates a and b as 


Po _ Po = (4 2 nk 
= ———————|—5-a-b 
Pei apepre = pa -A20 
2 —1/2 
& p -a — i) (3.18) 
no (2 + jo) 
Next, by applying the Lorentz force law to eq. (3.10), we obtain 
0.0 h 
d 7 5 E 1 D 
5 (RB B) Poria o o0 0 ess 
ds \ po po Po Po —h 0 0o / Po 
[^ 51 Ü S 2 
= Ze(B+ 0x B) E +n (2, 0-2). (3.19) 
Po Po Po 
We here introduce the magnetic rigidity Xm and the electric rigidity Xe, 
P pu 
Xm = = 


Ze Xe — Ze 
As we will see below, the magnetic and the electric rigidities describe directly 
to what extent the magnetic and the electric fields influence the geometric 
motion of the particles. The first term of eq. (3.19) can be expressed by using 
Xmo and Xeo as 

Ze 
Po ~ poto 
From eq. (3.12), the factor vot’ in the electric term can be written as 


2 $ EB B 
Ze(E «x B) L (E vx B) vot! = uot! e £x —t. 
X 


e0 Xm0 


1 
ai cS AZ = ee) PU (p quy tT 
v Ps Po V Ds 1+ 10 Ps 


Similarly, the factor vt’ in the magnetic term can be written as 
z = = 
of’ =" (the) & =F (1 nz) P = (1 + he) 22 
vU Ds p Ds Po Ps 
where v/v = p/p is used because of v || p. Thus, 


m at d^ 1 E p B 
Ze(E - $x B) — = (1 + ha) zu. A RU eee ya ete e LA 
Po 1+ 0 Xe0 Ps Po Xm0 Ps 


Continuing from eq. (3.19), we obtain 
d (= Py 3 
ds \ po’ po Po 


1 EB p B $ E 
Sige aw T +n (20-2). 
1+ "no Xeo Ds Do  Xmo Ps 


Fields, Potentials and Equations of Motion 65 


Finally, we consider the change of the last variable in the particle optical 
coordinates, 6. Since by definition it describes the deviation of initial kinetic 
energy of the particle of interest, we have 


à — 0. (3.21) 


Note, however, that since 6 describes the deviation from the kinetic energy of 
the reference particle before the system, in case there is net acceleration or 
deceleration along the orbits, it may be desirable to periodically absorb the 
accumulated amounts in the orbit dependent path integral for V in eq. (3.14) 
into the variable ô. In the case the motion was merely through a static electric 
field, this will entail that ó will depend on positional variables. In the case of 
full time dependence as in the motion in RF cavities discussed in Chapter 10, 
after the renormalization, ó will depend on all particle optical coordinates. 


3.3.2 The Equations of Motion 


By observing that p/po = (a,b, ps/po), from eqs. (3.11), (3.17), (3.20) and 
(3.21), we obtain the equations of motion in particle optical coordinates: 


z' -a(14 hz) P9. 

Ds 

1-75 E, B; B s 
d= (1+he)| DELE Se Se) Be 

1+ o Xeo Ps Xm0 Ps Xmo Po 
y -b hz) P2, 

Ds 

1 E Bz B, 
bf = (1+ hz) Han a, 

T No Xe0 Ps Xm0 Xm0 Ps 


[mn 


je ja +n ELS | um 
T No Ps vo 


à =0, (3.22) 


where we remind ourselves of the following abbreviations from eqs. (3.13) and 
(3.18), 


ZeV 2+ Ru 
n= m (1+8) - — >, 2 (agin ae) 
mc Ds — Vio (2 rio) 

and eq. (2.1), 

‘Yo 
1+ 0 
Note that the factor «/vo in the equation for l^ can be expressed in terms of 
no instead of yo using eq. (3.15): 


K = —VO 


K 1+ no 


vo — 2+ 
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We observe that the horizontal (x-a) motion is affected mostly by E, and 
B,, and the vertical (y-b) motion is affected mostly by E, and Bz, and it is a 
direct consequence of the Lorentz force law. When a longitudinal component 
of the magnetic field B, is present, it acts to mix the horizontal and the 
vertical motions through the B, dependent terms in a’ and b’, which is even 
a linear effect. This phenomenon is readily observed in the spiral motion of 
a charged particle moving through a solenoid if any transversal component of 
the momentum exists, which is described by a and b in our case. 

A careful analysis of the equations of motion reveals that indeed if all the 
particle optical coordinates are small, so are their derivatives defined through 
the equations of motion; indeed, the system is weakly nonlinear. 


Chapter 4 


The Linearization of the Equations 
of Motion 


In order to develop a matrix theory of particle optics similar to the Gaussian 
theory in glass optics, we have to linearize the equations of motion. This 
procedure is rather similar to other linearizations in physics; in particular, 
it is very similar to the study of so-called small oscillations in mechanics. 
Since the solutions of linear systems depend linearly on the initial conditions, 
indeed the resulting transfer maps will be linear as needed. It is worth noting 
that, although a 6 x 6 matrix is required to describe the linear motion, only 
blocks of 2 x 2 and 3 x 3 are needed for decoupled linear motion. 

We begin the actual process of linearization with the linearization of the 
fields, which corresponds to quadratic potentials in eq. (3.7). We begin our 
discussion with the case in which the potentials on the reference orbit vanish, 
which describes the situation of electric and magnetic multipoles as well as in 
deflectors. The case of electric and magnetic lenses do require the presence of 
potentials on axis, and they will be discussed in detail below. 

In the electric case, let us assume that there is no potential on axis, i.e., 
ao,o = 0, and that in the midplane, we have 


Es = — Ezo (1 + nez). 


Because of the recursion relation for fields, eq. (3.8), we obtain an out-of-plane 
expansion of 
Ey = Eso(h + ne)y, 


as well as an electrostatic potential 
1 2 B 
V(a,y) = Esox + 5 Eso(nez = (h + Ne) y ), 


which is chosen in such a way as to vanish on the reference orbit. 
In the magnetic case, let the midplane field be given by 


B, = Byo(1 + mz). 
Due to the recursion relation, we must then have 


By = Byonyy. 
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Before we even discuss linearization, let us consider the zeroth order of the 
motion: if the system is supposed to be origin preserving, then we must have 
from the equation of motion for a’ in eqs. (3.22) that 

E B 
x0 + yO -— 
Xe0 Xmo 
which in a natural and expected way couples the constant parts of the fields 
with the curvature of the reference orbit. 

Now we begin our process of linearization of the equations of motion (3.22). 

It is easy to see that 


(4.1) 


, 


x =a, y =b. 


We also obtain 


— = 1 + 6 == 2 Esox, 
No nome 
and after more complicated expansions 
1+ Z 
4 = 1 + a a= xO”, 

1+ m +m (lc 0) me? 
2+ Z 
Ze on SL ee ZE Bur 
2+ no 2+ no (2 + go) mc? 


Similarly, by using v1 +u —1 1+ u/2, 1/(1 + u) 21 1 — u for small u, we 
obtain 


p. _ | nĒ+n) 5 pg 
Po no (2 + no) 
1 1/1 1 Z 
ege que E up D ey 
2 2 -F no 2 m 2-m]o/ mc 
1+ 1 Z 
=] 1 2E no € LIRE e Gn 
2+10 — mo(2-t mo) mc 


Note that the symbol “=,” means we are keeping terms up to first order. 
After lengthy similar arguments, we also conclude 


1 1 Ze K 
v= je- — é +. matl Es s — 
j (1+ 10) 2-- o) — no (1+ 10) 2 no) m? 7^ | v 
1+ 1 Ze 1 
—1— —— —3 ro x | v + z0, 
2-Fno  mo(2-- mo) mc (2+ no) 
as well as 
E; B Ez 1 1 Z 
a! =) — 4 h? + TE, + Am + : 2 at = Exo k 
Xeo Xmo Xeo (1470) no (2 F No) me 
Ez 1 1+ 
+ 0 : no ô, 
Xeo (1+ no)” | 2 +70 
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where the relation (4.1) is used, and 


Ez B 
b = E o (h+ ne) + Pen y. 


To summarize, we have the linearized equations of motion as a set: 


X = 1 a, 
Ex B Ezr 1 1 Z 
a —— hip ea qo aped h 4 E QE A x 
Xeo mo Xeo (14-mo) no (2 F no) mc 
Ez 1 1+ 
ER 0 : No ó, 
Xeo (14- no) | 2+ lo 
y' =ı b, 
Ez 
y= | 9 (h 4- ne) + — no] v 
e m0 
1+ 1 Z 1 
be =, — h "lo =e T + z0, 
2+0  mno(2-- mo) mc (2 + no) 
5 =0. (4.2) 


Now that the equations of motion have been linearized, they have to be 
studied for a variety of different cases. We begin with the simplest case. 


4.1 The Drift 


In the case of the drift, all fields are 0, and h = 0, so the linearized equations 
of motion have the form 


= a, a =U, 
!^—b b 
1 
'=——6, $6 -0, 
(2 + no) 


where of course only the last equation is of any real interest. These equations 
are trivial to integrate, and we obtain 


Tf = Ti + qL, af = Qi, 
yf = yi + BL, by = bi, 
L 
lg ôi + li, of = ĝi, 


(2 +m)” 
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where L is the drift length, and they can be written in matrix form as 


xe 1L0 0 0 0 Tj 
af 010000 Qj 
vy | |001L00 Yi 
by | [| 000100 b; |? 
ly 00001 D li 
Of 000001 ôi 
where 
= L 
(2+ no)” 


First, we observe that, as in the case of glass optics, the determinant is 
unity. We also observe that the matrix can be grouped to three blocks, corre- 
sponding to the x-a (horizontal), the y-b (vertical) and the l-ô (longitudinal) 
motions. So, in the linear approximation, those three submotions in the drift 
are decoupled, without having any mixing. This allows us to study the entire 
motion in each direction conveniently independently. Later, we often study 
the motion of a system in decoupled submotions. 

It is worthwhile to note that, when we take account of nonlinearity, even 
the drift motion is no longer simply linear, which may sound striking. This 
can be seen in the equations of motion (3.22), and the nonlinear effect comes 
from the factor po/ps, which contains the second order contributions of a and 
b. This effect is called the kinematic correction, and it also is responsible 
for nonlinear mixing of submotions in different directions even for the drift. 


4.2 "The Quadrupole without Fringe Fields 


More interesting is the case of the quadrupole. Since the reference orbit 
goes straight, we have h = 0. 


4.2.1 The Electric Quadrupole 
For the electric quadrupole, from Section 3.1.2, we have 
V = M2,2c0s (29) r? = M22 (a? — y?), 


and 
Ez = —2M2 2 fX, E, d 2M2» "Y, 


while B = 0. If M22 > 0 for the positive charge beam, the field acts to 
focus the beam in the horizontal (x) direction, and defocus in the vertical (y) 
direction. The field description above corresponds to the case in the general 
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form that the constant part Ezo of E, is 0 and the factor of the linear term 


in z is Erone = —OE,/0x = 2M22. The equations of motion have the form 
2M: 
xv’ =a, a! = mg = wg, 
Xe0 
2M: 
y =b, b= y= wy, 
Xe0 
1 
|! = —— — 356, ð — 0, 
(2 + no) 
where 


2M22 
amg =e, 
Xe0 


Apparently we have sine-cosine solutions in the horizontal plane, and sinh- 
cosh solutions in the vertical plane. For the quadrupole with the length L, we 
have 


sinwL i 

zf = x,coswL + a; EE Gf = —wrj;sinwL + a; coswL, 
sinhwL 

yf = yi coshwL + bi aes » bs = wysinhwL + b; coshwL, 

w 
L 
lg = — îi + li, Of = ĝi. 
(2 + no) 


This can be written in matrix form as 


Tf cos(wL) sin(wL)/w 0 0 0 0 Ti 
af —wsin(wL)  cos(wL) 0 0 0 0 ai 
y| 0 0 cosh(wL) sinh(wL)/w 0 0 Yi 
b, 0 0 wsinh(wL) cosh(wL) 0 0 b; |? 
ly 0 0 0 0 1D|l 
5 0 0 0 0 DOE JA, 
where 
2M. L 
w=, 2 and D= MEE 
Xe0 (2+ no) 


Similar to the case of the drift, the matrix can be grouped to three blocks, 
namely the horizontal, the vertical and the longitudinal motions. Again, this 
holds while we limit ourselves to the linearized motion. We observe that, 
as in the case of glass optics, the determinant is unity. Furthermore, note 
that if M22 < 0, w is imaginary. In this case, the x- and y-planes exchange 
their roles, the quadrupole becomes focusing in the vertical (y) direction and 
defocusing in the horizontal (x) direction. To see the focusing action in the 
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similar style to the matrix form of the thin glass focusing lens, eq. (2.4), 
we consider a thin approximation of the quadrupole. While maintaining the 
integrated field strength, that is represented by 2M». L, we make L — 0. 
Then, we have 


soam) enc. eee a) 


This corresponds to a thin focusing lens with the focal length f as 1/f = 
(2M3.2/ Xeo) -L= "n : L. 

In various cases that will be studied in the following sections, we will observe 
a harmonic oscillator motion similar to the z plane motion in this section. In 
such a case, if the system is short it behaves like a thin focusing lens, and the 
focusing power can be obtained as 1/f = w? - L using the angular frequency 
w of the system, in the same way discussed here. 

It is also worthwhile to briefly mention the case of fringe fields. In this case, 
M22 changes as a function of s. The resulting ordinary differential equation 
(ODE) is still linear, which entails that the result can be written in matrix 
form, but in most cases is impossible to solve it analytically. 


4.2.2 The Magnetic Quadrupole 
In the case of the magnetic quadrupole, we have 
VW = —2 Mə 2x "Yy, DB, = 2M2 29, By = 2M2 2%, 
while E = 0. This corresponds to the case in the general form that the constant 


part Byo of B, is 0 and the factor of the linear term in x is Byonp = OB,/Ox = 
2M» 2. This results in the linear equations 


Pisces fe 2M2,2 _ 2 
T =a, a —— £ = -—wz, 
Xmo 

2M: 

y —b == y=u%y, 
Xm0 

1 
BS — ——30, ð — Q0. 
(2+ 70) 


Similar to the electric case, we have introduced 


2M2» 
w= ; 
Xm0 


and the resulting transfer matrix is the same as in the case of the electric 
quadrupole. 
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constant field 


( Cnm) 


FIGURE 4.1: A homogeneous magnetic dipole. 


4.3 Deflectors 
4.3.1 The Homogeneous Magnetic Dipole 


The next particle optical element we want to study is the magnetic dipole, 
consisting of a homogeneous, hence constant, magnetic field in the y-direction. 
We consider that the dipole element acts to bend the reference orbit by the 
bending angle ¢ with the bending radius Ro, so the curvature is h = 1/Ro. 
We also note that from eq. (4.1), h = Byo/Xmo. In terms of the quantities 
describing the linearized fields, we have 


Byo = constant, ny = 0, 
while E = 0. Keeping in mind magnet design, such a field can be obtained 


very schematically as shown in Fig. 4.1. 
Let us now consider the equations of motion; we obtain 


1+ 
x’ =a, a’ — —h?z-h ux 
2+ No 
y =b, he 
1+ 1 
U =-h Br + —— ó, 4-0. 
2+ No (2+ no) 


First we observe that if we choose h = 0, we obtain a’ = 0, and we have the 
same situation as in the case of a drift. But even for the case of h Z 0, the 
motion of the y-direction behaves simply like a drift, and we always have 


yp = yi + bi L, by = bi, 


where L is the arc length of the reference orbit in the dipole and L = Rog. 
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Next we observe that as always, 6 stays constant, and hence in the equation 
for a’ plays the role of a parameter, making the differential equation inhomo- 
geneous. Finally we observe that since | does not couple into the horizontal 
or vertical motion, we can solve the equation for l after the horizontal motion 
is analyzed by a mere integration. 

In order to solve the horizontal part of the motion, we first solve the ho- 
mogeneous part of the differential equation, which has the form 


and we obtain as a solution 
1 
xf = zi coswh + —ajsinwL = zi cos 9 + Roa; sin ¢, 
w 
1 
aş = —wa;sinwl + a; coswL = Rt sin ó + aj cos $, 
0 
where we have used the angular frequency w, 
1 
Ho. 
Altogether, we have a behavior not much different from a focusing quadrupole. 


In order to treat the inhomogeneity, we perform a so-called “variation of 
parameters," that is we make an ansatz of the form 


w=h= 


1 
x (s) = %;(s)cos¢ + Roa; (s) sin d = zj (s) cosws + —a; (s) sinws, 
w 
1 
a(s) = — Rt (s) sing + à; (s) cos ó = —w; (s) sinws + à; (s) cosws, 
0 


where now the original parameters Z;, à; are viewed as functions of s. Inserting 
into the differential equation, we obtain the following condition: 
1 
T; (s) cosws + —a; (s) sinws = 0, 
w 


ia 
—w7; (s)sinws + a; (s) cosws = A = hr 
T No 


using the abbreviation A for the right hand side in the second equation. 
Rewriting in matrix form, this reads 


Ce rS 


Multiplying with the inverse matrix and integrating, we obtain 


LE 


f 1 1 
Ti (s) = J (- sinus) Ads + xj = — A (cosus — 1) + vi, 
0 WwW 


W 


z 1 
a; (s) = f (cosws) Ads + a; = —Asinws + aj. 
0 w 
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So the complete solution of the inhomogeneous part has the form 


1 1} 1 
x(s)= [A (cosws —1) zl cosws + — | sinus + «| sinws 
w w |w 
1 . 1 
= x; cos 9 + —a;sinó + — (1 — cos), 
W W 
1 ; le bs 
a (s) = —w | — A (cosus — 1) + z; | sinws + | CA sinus + a;| cosws 
W W 


1 
= —uajsin $ + a; cos o + —A sin Q. 
w 


Thus we obtain 


12 
tf = vi cos ó + Roa; sing Ro (1 — cos) 5 M y, 
T 0 
: in ó + a; cos ó + si QE 
Qr = —— Ti Sın Qi in i. 
f Ro 2 + o 


Finally we have to study the case of the time-of-flight part, which as we 
said before can be obtained by mere integration. We have 


L 
1+ 1 
= | E ARL 55| ds +l; 
0 2+ 10 (2+ 10) 
i 14 " 12 R 
= — — 4 cos — — a; sin —— 
Ü 2 + no Ro Ro 2+mo 0 
ido 1 
- (1-85) ( 2) + 59i | ds + li 
Ro 2+ No (2+) 
1+ 0 S 1+ 0 
— x; sin — + Roa; cos — 
| 2+ no Ro 2 Ro 
Rod 
1 1459 V? 1 
+ To Ro sin —— Ôi Ho is + 5 Oi8 +1; 
2+n Ro 2+ 10 TEST ] 


1+ TE 
=— T rs sing + 5 ™ Rya; (cos à — 1) 


tm Fn 
1+ n a no 

+ R ôi — Rodd; + lj. 
(=) o (sin 9) Jum oP 


As a result, we see that all the final coordinates indeed depend on all initial 
coordinates in a linear fashion, and hence the relationship can be written in 
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terms of a transfer matrix. The general shape of this matrix is now 


LF cosó Rosind 0 0 0 (a|d) dà 
af —sin¢d/Ro cosé 0 0 0 (ald) ai 
ue | _ 0 0 1 Rọ 0 0 bile as 
by 0 0 0 0 0 b; 
Ly (ll) Ua) 0 METDSEP 
ó, 0 0 0 0 1 ó; 


where Ro and ¢ are the bending radius and the bending angle, and the ab- 
breviated matrix elements are 


1+ 1+ 
(10) = —(la) = Zp Ro (1 = cos), (al) = -(le) = 5 Sing, 


n lY 
0 + "no i 

ó — ( ) sing| . 
2+ No 2+ No 

We observe that the determinant of the matrix is unity. Note that, while 
the y-b (vertical) motion is decoupled, the x-a (horizontal) and the [-ó (lon- 
gitudinal) motions are coupled. Specifically, the z-a motion depends linearly 
on ô. 

'The homogeneous dipole magnet we have considered so far has edges that 
are perpendicular to the reference orbit. So the region where the magnetic 


field is active corresponds to a sector of a circle, which is the reason such a 
magnet is often referred to as a sector magnet. 


(l8) = —Ro 


4.3.2 Edge Focusing 


When the reference particle enters and exits a sector dipole magnet, the or- 
bit travels perpendicular to the entrance and the exit edges. When the magnet 
edge is not perpendicular to the reference orbit, additional focusing and de- 
focusing effects act on the beam, which is called edge focusing. The angle 
difference from the perpendicular sector magnet case is called the edge angle. 
Edge focusing is frequently used on purpose to modify the linear properties 
of the motion, or as a consequence of convenience in manufacturing since it is 
particularly simple to use a rectangular shape for the magnet, which leads to 
the so-called parallel-faced dipole. We now study the effects of edge focusing 
using the matrix form, and compare the result with the sector dipole. 

We measure the edge angle o such that the rectangular dipole would have 
positive edge angles. So, when a > 0, a particle that enters or exits the 
magnet at a positive z location experiences a lesser amount of the bending 
magnetic field compared to the reference particle. Compared to the sector 
dipole magnet, this means that the edge line tilts inward for positive x as 
shown in Fig. 4.2. 
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Entrance 


FIGURE 4.2: Entrance and exit edge lines of a dipole magnet with edge 
angle a. 


When the edge angle a is sufficiently small, the effect can be approximated 
as an impulsive effect that changes only the horizontal and the vertical angles 
of the particle orbit but does not affect position nor the longitudinal motion. 
The impulsive style of treating such an effect is called “kick,” and the kick 
approximation is sometimes used when studying beam optical systems in lin- 
ear approximations in a similar way as it is in glass optics in the use of the 
thin lens. 

As we will explain in the following, in the kick approximation, the effect of 
the edge angle a for the homogeneous dipole magnet acts to change only a 


and b via 


POSU. jsp SO (4.4) 
Ro Ro 


where Ro is the bending radius of the homogeneous magnet, and 


af = ai + 


1 By 
Ro X^ 


The same expression applies to both the entrance edge and the exit edge. 
Using the abbreviation 
T = tan o/ Rio, 


the matrices of the horizontal (x) and the vertical (y) kicks by the edge angle 


qa are described as 
~ea {1 0 ved — 1 0 
M; =(; Lp M; = -T 1)}° 


Recalling the situation of thin glass lenses, when œ > 0, the vertical kick 
acts to focus the beam, and the horizontal kick acts to defocus. Combining 
the horizontal and the vertical kicks, the effect of the edge is that of a thin 
quadrupole of the strength —T, where always one of the directions experiences 
focusing, and the other defocusing. When the sign of a is opposite, the effect 
also becomes opposite. 
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FIGURE 4.3: Mechanism of edge focusing for the horizontal plane (left) 
and the vertical plane (right). 


We use a geometric argument to explain the horizontal kick. For the vertical 
kick, we use a step function to model B,, and use Maxwell’s equation to derive 
B, which affects the vertical motion. 

We have a homogeneous bending magnet with the entrance edge line tilted 
by the edge angle a as shown in the left picture of Fig. 4.2. Using the standard 
step function H, also often referred to as the Heaviside function, the vertical 
field component B, can be expressed as 


By(z,y, 8) = ByoH (s — z tana), 


where Byo is the constant field of the main part of the dipole. The Heaviside 
step function H is related to the Dirac delta function 6 as 


PA 2a Tx 1 forz»0 
uoy- | sys ED for z <0’ 


+oo forr —0 = 
d(x) = { 0 fore £0’ O(a) =1 foranye>0. (4.5) 


=e, 

The B, expressed above is of course an idealized situation. In reality there 
is no magnetic field that can fulfill the above expression while satisfying 
Maxwell’s equations. 

Now consider a particle approaching to the entrance of the magnet paral- 
lel to the reference orbit, but positioned at x. As seen in the left picture of 
Fig. 4.3, the entering of this particle is delayed by the distance rtana. In 
the meantime, the reference particle travels through the magnet for this much 
of arc length, experiencing a deflection angle amounting to 6 = xtana/Ro. 
When observing the situation in the particle optical coordinates that are at- 
tached to the reference particle’s motion, the particle of interest located at 
the position x appears to have experienced a change in the direction of mo- 
tion by +z tana/Ro. Note that the picture in Fig. 4.3 is drawn exaggerated 
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to emphasize the relevant points. The same explanation applies to the situ- 
ation at the exit, where the change in the traveling direction appears to be 
+z tan o Ro. Thus, the formula for a; in eq. (4.4) has the +z; tana/ Ro term 
for both the entrance edge and the exit edge. 

Next we consider the equation for b’ in the set of equations of motion (3.22), 
and observe that the B, term is the leading term to affect the vertical motion. 
As seen in Fig. 4.1, there exists a non-vertical field component around the 
edges of the magnet off the midplane, i.e., when y Æ 0. Also, see the right 
picture in Fig. 4.3. From Maxwell's equations (3.2), we have the relation 


ðB, OB, 


Oy | Ox’ 


and using this, we can derive B, as 


OB ð 
B,(r,y,s)-— jJ -dy =| — ByoH(s — xtana)| dy 
Ox Ox 
= —B,o tanad(s — x tano): y. 
Thus, the equation for b’ of (3.22) becomes 
B; By 


y = 
Xm0 Xm0 


tanad(s — z tan a) y, 


and in the impulsive or kick approximation we obtain 


tana 
i 


B 
bp = bi — = tana f a(s — z tan a)ds : yi = bi — 
Xmo 
Now, at the exit side, having the opposite sign for the step function, By is 
expressed as 
B,(2,y, 8) = ByoH(—s — z tana), 


resulting in 
B,(z,y,s) = —Byotanaó(—s — xtana)- y. 


And, we obtain the same result as for the entrance case, namely 


B t 
bp = bi — yo tana f 6(—s—atana)ds- y = bi- Py, 
XmO0 Ro 


We note that the rise of B, is caused by the mere tilting of the edge line; 
thus a longitudinal component B, also exists, and it can be derived in a similar 
fashion. But the B, dependent term in the b’ equation of (3.22) also depends 
on a, turning it to be a nonlinear term; thus we do not consider it for this 
linear kick approximation. Another note is that in the homogeneous sector 
dipole magnet, B, does not exist because a = 0. 

Since the rectangular dipole is rather commonly used, it is worthwhile to 
calculate the total transfer matrix of a rectangular dipole by combining the 
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edge focusing and the main part of the dipole using eq. (4.3). In this case, 
the edge angles of the entrance and the exit are the same, namely half of the 
bending angle and thus 9/2. We study the z-a-[-ó block and the y-b block 
separately. Below, the abbreviation T = tan($/2)/ Ro is used. 

First, we calculate the combination of the main part and one edge for the 
z-a-l-ó block: 


cosó Rosing 0 (2|d)ai 1000 

yoga" sinó/Ro | cosó 0 (ald)ai T 100 
E Š (I|z)ai (1|a)ai 1 (1|ó)ai 0 0 1 0 

0 0 0 1 0001 


cos¢+TRosing Rosing 
—sinó/Ro--Tcosoó coso 
Qx)at+TUlaja — (I]a)ai 

0 0 


o.oo 
= 
EN 
œ 
a) 
g 


where the (1,1) and the (2, 1) components of the matrix multiplication are 
simplified to 1 and —T' as follows. 


29 29 $ $9 


cos o + T Ro sin @ = cos 3^ sin 3 Tian : 2sin 5 cos 5 = 
E B .Ọ Ọ Ó 2 2 P| |— 
—sind/Ro + T cos ó = -T [asin $ cos 7 tan 3-985 - sin ap —T. 
So, we obtain the matrix of the x block as 
1 00 0 1 Rosing 0 (a|d)ai 
Xr, = M Ne res — T100 -T cosó 0 (al|ó)ai 
0.0 1 0 (Ia) ai + T (la)ai (lla) ai 1 (1|ó)ai 
000 1 0 0 0 1 
1 Ro sind 0 (x|ô)ai 
z 0 1 0 T(z|ó)ai + (a|9)ai 
T(la)ai + (læ)ai (aja 1 (1ó)ai 
0 0 0 1 


where the same simplification happened for the (2, 2) component, and 


1+ 
(z|ô)ai = T ui (1 — cos), 

No ld 
(l|ó)ai = — Ro E (zz) ie ; 
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and 
T (x|ó)ai + (a|ó)ai = — [T (L|a)ai + (1|x)ai] 


1 $ 1- mo lcm" . 
Ec pM R«(1— 
: an g3 "m o (1 — cos à) + Dew sing 
1+ 70 Q 2 1+ mo Q 
= >(1— + = 2tan =. 
Ts jan 7 (1 — cos 9) + sin uum an 5 


The matrix of the y-b block is 


P 1 0\ {1 Rod M 1— TRoó Roó 
eg qu 1 -T 1} \-T(2-TRo¢) t= Thee} 


where 
1 
1- TR) = 1- ótan §, -TQ - Rod) = -7z tan $ f2- otan $] ] 
In summary, we obtain 
cf 1 Rosing 0 0 0 (x|ð) Ti 
af 0 1 0 0 0 (ald) ai 
yp | | 0 0 1-ótan(9/2) Roe 0 0 Yi 
by | | 0 0 (bly) 1—¢tan(¢/2) 0 0 bi |? 
n (v) (lla) 0 0 1 (id) || li 
OF 0 0 0 0 0 1 ĉi 
where the abbreviated matrix elements are 
AM. Q $ 
(bly) = CS 7 2- ọtan 1 ; 
E _ 1+ i - — lc $ 
(z|) = —(!|a) = pue —cos¢), (ald) = —-(Ilz) = a oe 


(l8) = —Ro 


15V. 
m ó — zn sinó|. 
2 + No 2 + no 
Note that the determinant of the matrix is unity, which also can be deduced 
from that the determinant of all the contributing matrices is unity. 
To conclude, let us compare the characteristic effects of the rectangular 


dipole and the sector dipole in the limit of small deflection angle ¢. The x-a 
matrix and the y-b matrix of both dipoles can then be approximated as 


i : y 1 Roe i 1 Roo 
Sector dipole: Mza > [om : ) , My (4 es 
Rectangular dipole: Mya > F L3 , My er 2 


Thus we have the interesting effect that the characteristic behavior in the 
horizontal plane and the vertical plane is exchanged. 
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weaker field 


FIGURE 4.4: An inhomogeneous sector magnet. 


4.3.8 The Inhomogeneous Sector Magnet 


In the case of an inhomogeneous sector, there is a magnetic field that is 
constant in s-direction, but not constant in the x-direction; rather it has the 


shape 
x 


From the recursion relations for the fields, we infer that the corresponding 

horizontal field is 7 
By = —Byo "AS 

ny in eqs. (4.2) is na = —n/ Ro, and h = 1/ Ro. In general terms, such a field 

is obtained by changing the distance between what generates the fields (coils 

or iron) as a function of z, similar to what is shown in Fig. 4.4 for the case 

of n » 0. 

We have the linearized equations of motion as 


B 1 1+ 
xv’ =a, d=- àr- Ze ean Tn E E adh 29 d 


Xmo Ro 2+ no 2+ 70 
B 
y-2b V=- y = hm, 
Xmo Ro 
1+ 1 
U = -h Ly 55, 4-0 
2+ no (2 + no) 


We observe that the horizontal motion is similar to the case of the homoge- 
neous sector dipole, except that the strength of focusing now also depends 
on n, the field inhomogeneity. Different from the homogeneous sector dipole, 
there is now an effect in the vertical direction, which can be either focusing 
or defocusing, depending on the sign of n. 

The solution of these equations of motion proceeds in the same way as 
before, first solve the homogeneous system, then address the inhomogeneity 
arising from 6 via variation of parameters, and finally solve for | by a mere 
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integration. In horizontal and vertical directions, the homogeneous solution 
corresponds to harmonic oscillators with frequencies 


wy =hV1l—n, wy = hyn. 


For 0 < n < 1, the magnet is focusing in both planes. An interesting case 
occurs for n = 1/2, in which case the magnet focuses x and y identically and 
represents a nice equivalent of the glass lens. 

The remainder of the derivation is tedious algebra, and we will only list the 
result here. 


vr (zjx) (xla) 0 0 0 (ad) Ti 
af (alz) (ala) 0 0 0 (ald) ai 
yp | _ 0 o (ly) (yb 9 0 vl (46) 
bp | 0 0 (by) (bb 0 0 A K 
I; (lx) (lla) 0 1 (qu [ih 
y 0 0 0 0 e 1 5; 
where 
(z|x) = (ala) = cos ( 1- nó) ; 
R vl-n 
(xja) = Vi = = sin (v 1— nó) ; (ala) = — Ro sin (v 1— nó) s 


(yy) = E = cos (vng) , 


(ub) = sin (Vno) (oy) = - X sin (vo), 
(2|6) = - (lla) = z — " E [1 — cos (V1 = n$)] , 


(a|ó) = —(l|v) = ; - am MIS sin (V1— n4), 


ee yy Eod MK Mm 
UE (m) ds (L+)? i 


and the determinant is unity. 


aca sin (V1—n$) | 


4.3.4 The Inhomogeneous Electric Deflector 


Rather commonly known is the motion of a particle in an electric capacitor. 
Neglecting fringe fields, it follows a parabola as shown in Fig. 4.5. For particle 
optical purposes, such an arrangement is not particularly suitable for two 
reasons. Firstly, the reference orbit has a curvature that depends on s, which 
makes the differential equations non-autonomous. Secondly, the potential 
along the reference orbit changes with s, which complicates the dynamics. 
Both of these problems do not appear if instead of a straight capacitor, one 
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parabola 


FIGURE 4.5: An electric capacitor consisting of two parallel plates. The 
orbit of a particle is parabolic. 


Ro 


FIGURE 4.6: A concentric electric deflector. 


chooses a curved one in such a way that the reference orbit is concentric 
together with the plates as shown in Fig. 4.6. 


To obtain the linearized equations of motion for such a device, we first 
observe that 


while B = 0, and using eqs. (3.16), 


Povo "Mo (2 + No) To (2 + Mo) _ mc? mo (2 + no) 
me no (2 + o) —— ——_.. 


RO Ges Ze 1+ no Ze 1+%0 


We describe the linearized electric field using the field inhomogenuity n, sim- 
ilar to the magnetic case, as 


hence ne in eqs. (4.2) is given by ne = —n/ Ro, and h = 1/ Ro. The linearized 
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equations of motion are 


1 1 1+ 
xt =a, d =—h? |2—n+—5|xe+h]1+ dd 
(1-F mo) (1 + 70) 2+ no 
y =b, =A? (1—n)y, 
1 
Qu DO pe | ghee og esp) 
2+ no (1 + mo) (2 +70) 


Since the electrostatic deflectors are used primarily for low energy electrons 
and ions (usually below 100 keV) due to the difficulty of achieving high static 
voltages, the particles are non-relativistic, i.e., ro < 1. As a result, the equa- 
tions of motion can be simplified to a more familiar form 


prp a! = —h? (3 — n) x + hô, 
1 
y — b, y — —h? (n— 1)g, U=—he 46, ð — Q. 


We observe that for 1 « n « 3, both x and y planes are focusing, differ- 
ent from the case of quadrupoles where always one plane defocuses; but the 
amount of focusing in the x and y planes is different. Indeed, similar to the 
inhomogeneous dipole magnet, the transfer matrix is 


LF (z|z) (zla) 0 0 0 (ad) Ly 
af (alx) (ala) 0 0 0 (ald) ai 
us| 0 0 (wy) (yb 0 0 yi 
br 0 0 (by) (bb 0 0 b; |? 
ly (|y) (|a | 0 0 1 (Uð) l; 
à, 0 0 0 0 0 1 5; 
where 
(a|x) = ius = cos (V3 — nọ), 
3— 
(xja) = A sin (V3 — no), (a|z) = m sin (V3 = nd) 
(yly) = (blb) = cos (Vn — 19) , 
(y|b) = -= sin ( n— 19) s (bly) = — E sin (vn — 19) 
(219) = —(la) = = [1 — cos (V8 —n4)]. 
(aléy esie: A— siti (394), 
1 1 1 . 
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1 I 


FIGURE 4.7: An electric deflector with cylindrical plates. 


ELEN 
lf | 


FIGURE 4.8: An electric deflector with spherical plates. 


and the determinant is unity. 

So far, no assumptions have been made about the vertical shapes of the 
electrodes, and in fact a variety of choices exist. Two common situations are 
the cylindrical plates and the spherical plates, as shown in Fig. 4.7 and Fig. 
4.8, respectively. 

In the case of the cylindrical field, we have from Gauss’ law that E œx 1/R, 
which implies the expansion 


Ez = X LM = — Ero 1— = > 
Ro +2 R 


and hence corresponds to n = 1. The transfer matrix is 


rf cos(V/2¢) (Ro/v2)sin(V29) 0 0 0 (a|d)\ (a; 

af —(V2/ Ro) sin(v2¢) cos(v/29) 0 0 0 (ajó) | | a; 

yl 0 0 1 Rod 0 0 Vi 

by 0 0 0 0 0 bi |? 

ly (Ix) (lla) 0 0 1 (jfk 

Of 0 0 D. 0 0 1 ô; 
where 


(z|8) = —(lja) = = È — cos (v20) > S (v20) l 


/2 
(8) = -Ro E x A sin (vss) 
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For à = 1/2 = 127.28, the deflector is imaging in the horizontal plane, 


which has been used as energy spectrometers. 
In the spherical case, we have E œ 1/R? and thus 


Ro : x 
° fer) > ae ( is) 


and n = 2. The transfer matrix is 


vf cosó Rosing 0 0 0 (a|d) Xj 
ar —sinó/Ro cosó 0 0 0 sing ai 
ys 0 0 cos@ Rosing 0 0 Vi 
by 0 0  —sinó/Ro cosd 0 0 b; 
l — sinó lla 0 0 1 1|d) li 
f 
ó, 0 0 0 0o 0 1 ó; 
where 


ai= (pes (Fo sind) | 


When ¢ = m = 180°, this deflector forms a simultaneous image in both plane, 
also known as a stigmatic image. It has been and still is widely used as 
an energy spectrometer. It is also called a hemispherical analyzer, which is 
the main workhorse in the field of angle-resolved photoemission spectroscopy 
(ARPES). 


4.4 Round Lenses 


We now address the important class of so-called round lenses, which owe 
their name to their rotational symmetry along the beam axis. Electric round 
lenses are usually made of arrangements of rotationally symmetric metallic 
plates or tubes concentric with the reference orbit, each of which is held at 
a certain potential; and magnetic round lenses are usually made of solenoids 
carrying current and concentric with the reference orbit. 

The rotational symmetry apparently entails that any fields in x and y di- 
rections are equal, and Maxwell’s equations immediately show that this can 
only happen if there are also fields in the direction of the axis, which requires 
that the potential changes along the reference axis. Specifically, the potential 
for the rotationally symmetric case is described in eq. (3.6), and is given by 


V =a Vo (8) - EVG (s)? =2 Vo (8) - EVS (e) (2? +1?) 
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up to second order in r. Therefore, the electric field, linear in x and y, is given 
by 


1 1 
Ey =1 gu (s)r Ey =1 243 (s)y, Es =1 —Vo(s), 
where E, and E, can also be expressed in terms of E, as 


1 1 
Ez =1 ~ 52s (s)x, Ey =1 ~ 52s (s) y. 


Fully analogously, we obtain in the magnetic case that 
1 1 
By =1 ao (s) x, B, al zo (s) v, Bs =1 -V (s), 


where B, and B, can also be expressed in terms of D, as 


1 1 
B,24 -3B (s)z, By=1 — 38: (s) y. 


We note that the expansion order listed in the subscript of the equal sign 
here in this section only applies to the variables r, x, y, a, b and 6, while 
the dependence on s is explicitly retained and not expanded. Fig. 3.3 shows 
typical resulting electric and magnetic scalar potentials as well as the resulting 
radial and axial fields for such cases. 

Before embarking on detailed studies of the dynamics in electrostatic and 
magnetic round lenses, we illustrate one of their important characteristics. We 
determine the average radial electric or magnetic field along a straight line a 
fixed distance r away from the center. We perform the averaging from —S to 
S where S is chosen large enough that all fields vanish at +S. We obtain 


S S 
1 E,ds = ri B,ds 
-5 -s 


5 1 
z a SVO (s) rds = 5r [Va (9) - Va(—S)] = 0. (4.7) 


So all radial field components average out to zero. This is in stark 
contrast to for example the electric and magnetic quadrupoles or the combined 
function bending magnets, where the integrand of the radial field is constant 
throughout the integration. 

Consider now the case of a thin round lens in which a particle does not 
change position much, similar to the situation in the idealized thin lens. In 
this case the average in the above field integrals will be responsible for the 
directional offset the particle experiences, and so this offset is zero. Thus any 
focusing action the particle may experience must come through secondary 
effects and is the result of a then incomplete cancellation of radial field con- 
tributions. This situation is sometimes referred to as weak focusing. These 
effects will be studied in detail below for both the electrostatic and magnetic 
cases. 
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Since the case of rotationally symmetric s-dependent potential is not cov- 
ered by the assumption used for the linearized equations of motion (4.2) de- 
rived in the beginning of this chapter, we go back to the general equations of 
motion (3.22) in Chapter 3. In this case, h = 0, and hence the set of equations 
is given by 


1 _ Po jx. 1+ Ex po B, Bs po 
Tr = — a, a= Sp OD, 
Ds 1+ No Xeo Ps Xmo Xm0 Ps 
yem quu dX ES Dg Pa Hs 
Ds : 1 + no Xeo Ds XmO0 Xm0 Ps à 
(-(nRBA)I à 20 (4.8) 
1+ ops vo” í 
where 
—1/2 
ZeV 2 
0 =n +8) - ——., 2 (2H +) mer) 2 1 
mc Ps — lo (2+ m0) 
i se Ep _ Po _ Povo 
vo 2+ 10° Mme Gee AE Ge 


In the following subsections, we study the two important regularly employed 
classes of lenses, the magnetic round lens and the electric round lens. As we 
will see, different from the cases of the electric and magnetic quadrupoles, 
their focusing properties arise from different mechanisms, and the magnetic 
and electric round lenses require a quite different treatment. 


4.4.1 The Electrostatic Round Lens 


Electrostatic round lenses are arrangements of metallic electrodes with ro- 
tational symmetry that act as equipotential surfaces and thus determine the 
on-axis potential. Electrostatic round lenses are frequently used in electron 
microscopes and also in the shaping of low-energy beams near the source. For 
a rotationally symmetric electrostatic lens, we have the electric field, linear in 
x and y, given as 


E, = zVo (s) x, E,—zVy (sw E.— =V (s); 
and the corresponding electrostatic potential 


V=Vols)— 5 o (s) (2? rv). 
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Applying these to eqs. (4.8), we have 


1 1 1 
g = Po ds d = +n Bo Syy (s) a, 
Ds 1 + no Xeo Ps 2 
1 Po 1 1+7 1 pol " 
= —), b= —-W(s)y, 
á Ps l- no Xeo ps 2 (s) 
1 
tel Hia jE, & =0. (4.9) 
1+ no Ds Vo 


The z-a part and the y-b part are decoupled, and they have the same form, 
so we only need to study one of them. 

For the further study, it is convenient to express the a’ equation above 
in various ways. Using the relation p/v = m(1+7) from eq. (3.16) and 
V/v = p/p because of d || p; we can write 


1 1 1 1 
at = T Do y” (s) x vo 


= Ic nb Xeo Ps 2 0 vi (s) a. (4.10) 


7 2X0 Us ? 


We note that in linearization, vs, the s component of v, equals the velocity of 
the reference particle. Furthermore we observe that the focusing effect rests 
on the assumption that traveling through the potential leads to appreciable 
changes in velocity, which, for practically achievable voltages, limits the use of 
the effect to the near non-relativistic regime. So in the following we perform 
our argument in the non-relativistic limit and have 


vo = V 2Ko/m E VKo 
Us 2K(s)/m  \/Ko — ZeVo(s) 


Here K(s) denotes the momentary kinetic energy of the reference particle, 
which of course changes as a function of position in the lens due to the change 
of Vo(s), while Ko is the constant kinetic energy of the reference particle before 
the round lens. 

The above differential equations are all that is needed to determine the 
transfer matrix for the linearized motion of an electrostatic round lens with 
a certain potential distribution Vo(s). However, in the following, we try to 
obtain a better understanding of the situation, and in particular we show the 
reasons why these lenses are generally focusing. 


4.4.1.4 Hard Edge Fringe Fields 


As a first step towards understanding the behavior of electrostatic round 
lenses, we discuss the fringe field effects appearing in an abrupt transition into 
a region of axial field from a field-free region. In practice this situation arises 
when transitioning through a hole of small aperture in a large charged metal 
plate. When passing through the plate leading to the idealized hard edge at 
s = 0, E, and E, are expressed in terms of the Heaviside step function H and 
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the Dirac delta function ó defined in eqs. (4.5) via 
Es(s) = EoH(s) Ej(s)-— Eoó(s), 
which in terms of potentials corresponds to 
Vols) = V — sEoH(s), Vo(s) = —EoH(s), Vo (s) = —Eoó(s), 


where we assume that the transition happens at an initial potential V, and we 
use the impulsive kick approximation in a similar manner as applied to the 
dipole edge focusing in Section 4.3.2. The position x is unaffected over the 
infinitely short transition at s — 0, and we obtain 


* vK PEL MI 
em 0— \/ Ko — ZeVo(s 


is n vko CRF HN v Ko 
'2xeo VKo — ZeV SEM Ko — Zev” 


where we use the abbreviation 


af = 


= ay — 


Eo 
2Xe0 : 


In an analogous way we can treat the transition from a region with field Eo 
into a field free region, and obtain 


E.(s)— EoH(-s), | Ei(s)-— —Enó(s), 
where the exit is at s = 0, so that we obtain 


EC EV a pe A CR 
2xeo V// Ko — ZeV vU A Rymy 


To summarize, the 2 x 2 transfer matrices for (x,a) at the entrance and the 
exit are given as 


af — ài +t Ti 


sm -( XJ zai) s -( 1 A 
~ (ay Ko/(Ko- ZeV) 1 J’ ~ \-ayKo/(Ko—- ZeV) 1 J’ 
(4.11) 

where a = —Eo/(2Xeo) is used. We emphasize that V is the momentary value 
of the potential, and V appearing in Min may differ from that in Mont, 

Overall we have kick effects that, similar to other fringe field cases, leave 
the position unaffected, but change the directions by an amount proportional 
to position. For a particle of positive charge, stepping into a positive field 
from a field free region leads to a focusing effect. Changing the sign of the 
field or stepping out of a positive field leads to a defocusing effect. 
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4.4.1.2 The Mechanism of Focusing 


We now consider in a very general manner the situation of a weak or short 
electrostatic lens. We observe that in eq. (4.7), one important effect has been 
neglected, namely the fact that the particle necessarily gains and loses energy 
as it travels through the lens. As a matter of fact, any transverse field bends a 
particle more readily when it is applied where the particle has lower velocity. 

Consider a weak lens that consists of a potential that in the region from 
—S to +S is first constant, then rises, then plateaus, and then falls off to its 
original constant value, similar to the case shown on the right in Fig. 3.3. 
Necessarily the regions of positive second derivative appear near the minima 
of the potential function, while the regions of negative second derivative cor- 
respond to maxima of the potential function. So while traveling through the 
lens, the particle is more sensitive to the focusing fields than to the defocusing 
fields; and even though the focusing and defocusing portions cancel, the net 
effect is that the orbit experiences focusing. To observe this quantita- 
tively, we transform the integral in eq. (4.10) via integration by parts, and 
obtain 


Xi v Ko 


Ti 7 v KoZeVg3(s) 
2Xe0 J/ Ko — ZeVo(s) 


S 
Vi) ; Wi (5) ds. 


_g *Xe0J—s [Ko — ZeVo(s)] 


The first term vanishes, and using Xeo = povo/(Ze), which non-relativistically 


reduces to 
|. 2Ko 


e0 = ; 4.12 
Xe0 Ze ( ) 


the expression takes the form 


S / 2 
af = ai — 2s f a | ES ds. 
—S Ko = ZeVo(s) 2X0 


Since the integrand involves only non-negative terms, the integral itself 
is non-negative; and if there is any place where the potential on axis Vo(s) 
actually changes, then the integral is actually positive. Thus the change of a 
is negative, corresponding to focusing. 

As we have seen, other than restricting ourselves to a thin or weak lens, 
this result has been obtained by mere manipulations of the integral without 
any assumption of the specific form of Vo(s) except for its constancy at +. 
This is actually a small example of various similar manipulations that have 
been developed in the past; for example one can show the focusing property 
even in the extended case [60, 67]. More impressively, conceptually similar but 
practically more involved arguments can be used to prove Scherzer’s theorems 
[62] about signs of higher order aberrations. 

The resulting formula can be used to obtain an approximation of the focal 
length of a thin or weak electrostatic lens. However, other coordinates than 


Neo 


The Linearization of the Equations of Motion 93 


our z and a are used frequently in the electron optics community; in particular, 
the coordinate x is scaled to 1 by a factor depending on the kinetic energy of 
the particle. This leads to less change in the value of the position coordinate 
Z as we travel through a lens. This nonlinear transformation leads to the 
approximation of z=constant being better than that of z—constant, which in 
turn leads to better estimates for focal length. There is a large amount of 
work on this topic, but we forgo the details and refer to some of the literature 
[56, 60, 67]. 


4.4.1.3 The Plate Lens 


We now address a particular type of lens for which it is possible to derive 
the transfer matrix analytically. We consider combinations of individual plates 
placed perpendicular to the optical axis, each of which has a small hole in its 
center through which the beam travels. Each of these plates represents a hard 
edge fringe field as discussed above. 

The plates are held at different potentials and form equipotential surfaces. 
Assuming that the plates extend to infinity, the electric field between two 
successive plates is that of a common plate capacitor, and it points in the 
direction of the reference axis. 

For any electrostatic lens it is desirable that the field far away vanishes. 
Compared to the case of the hard edge fringe field, this requires the use of at 
least two plates. Assuming the field between the plates to be Eg and their 
distance to be S, i.e., 


Eo for0O<s<S$ 
THAM { 0 fors<0,s>S8’ etd) 
the potential at the second plate is V = —S Eo, and we have the potential 
function 
0 for s< 0 
Vols) 24 —-sEo for0<s<S. 
—SEg fors>S 


The effects at s = 0 and s = S are merely those of hard edge fringe fields as 
discussed before. It is worthwhile to point out that if V > 0, which entails 
that Eg < 0, a particle of positive charge has lower energy at the second plate, 
and thus is more susceptible to the transverse electric fields there. From the 
discussion of hard edge fringe fields, we know that the transverse fields in 
both places are of opposite sign but identical magnitude, thus the second 
field, which happens to lead to focusing, has a more significant effect. 

Using eqs. (4.11), we have the transfer matrices for the kicks at s — 0 and 
s — S as follows. At s — 0, we have 


Vo(0) 20, K(0)= Ko,  ps(0) = po = v 2mKo, (4.14) 
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and at s = S, 
Vo(S) 2 —SEo, K(S) = Ko + ZeSEo, 
ps(S) = ps = y 2m (Ko + ZeS Eo). 


Thus, we obtain the transfer matrices at s = 0 and s = S as 


My = ( E i ) um es 1 ) è m) 


where again the abbreviation 


Eo 
2Xe0 


is used. 

In order to study the two-plate lens quantitatively, we need to still determine 
the transfer matrix of the space between the plates. In this region, a charged 
particle experiences uniform acceleration or deceleration without change in 
transverse momentum, i.e., a; = aj. However, the position changes in a similar 
mechanism to that of a drift, but including the effect of the change of the 
kinetic energy. For the region between the electrodes, we have from the x’ 


equation of (4.9), and using ps.(s) = \/2m (Ko — ZeVo(s)), 


S S 
VK 
As-s;-s-a] Peis =a; f oa eg 
0 ps(s) 0 V Ko + ZesEo 


E a (VKo + ZeSE; — VKo) a 2 po (ps — po) 


Zeko ' ZeEo 2m 


where the following relation is used to simplify the last step, 


p3 — pi = 2m(K(S) — Ko) = 2mZeS Eo, (4.16) 
and the abbreviation 9 
Lg ‘PO 
Ds + Po 
is used. Thus the transfer matrix of the gap between the plates is 
^ _( 1 S-2po/(pst+po) V. f 1 Ls 
M; = ( 0 1 - ws oq ds (4.17) 


We note that if there is no change in the kinetic energy and thus ps — po, this 
matrix agrees with the matrix of a drift with length S. On the other hand, if 
the system is accelerating, we have Lg « S, while for a decelerating system, 
Lg > S, reflecting the behavior expected from a simple geometric analysis. 
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Combining the above matrices, we construct the transfer matrix of the 
electrostatic round lens with two plates as the field described in eq. (4.13). 


WM E e 3r me 1) 


E ( 1+aLs Ls ) 
~ \al[l—(po/ps)(1+eLs)] 1—oLs(po/ps) / ` 


This is a rather compact representation of the matrix. However, to more 
clearly observe the influence of the defining parameters V and S, we now also 
express the matrix in terms of these quantities. We note that o Ls can be 
expressed in terms of momenta using eqs. (4.16), (4.12) and (4.14), and we 
obtain 

Eo 2po 1 Ze pg- po 2p Po-pPs 


alg = — -S eu uir E ee MM TI 
9 ^ 9xe  ps-^po  22Ko 2mZe ps+po  2po ee) 


Using this, the elements of M can be organized as 


3po — ps 
z\x) = 1 + aLs = ———, 
(z|) I 
= ae 
lua micum qq e c PME 
Ps 2ps 2ps 
3p = 
ce h _ Poy + aks) a (: B si] _ ag P Ps 
ps ps  2po 2ps 
= -302 Z Lg 
ps 
resulting in 
P (3po — ps)/ (2po) Ls ) 
M= ; 4.19 
( —3o?Ls(po/ps) (3ps — po)/(2ps) ety) 


In this representation, it is obvious that the (a|z) element is always negative, 
and thus we obtain the important conclusion that the two-plate lens always 
focuses. 

As observed above, the simplest plate lens leading to vanishing electric fields 
at far distance required two plates. However, the potential after the two-plate 
lens differs from the potential before. Thus, in order to achieve identical 
potential before and after the lens, which is often desirable in practice, at 
least three plates are necessary. So we consider a lens that consists of three 
flat electrodes as shown in Fig. 4.9, where the axial electric field E;(s) and 
the potential Vo(s) are given as 


Ey for -S<s<0 
E.(s) = 4 —Eg for O0<s<S, 
0 for ls| >S 
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Vo(s), 
BE V M 
25 0 S S 


FIGURE 4.9: Layout (top) and potential profile (bottom) of the electro- 
static three-plate round lens. 


and 
—sEo +V for -S<s<0 
Vo(s) = sEg-V for O<s<S, 
0 for ls| > S$ 


and the potential V at the middle plate is given by 
V=—-SEp. 


We observe that we can treat this system as a combination of two of the two- 
plate electrostatic round lenses already discussed. For this purpose, we first 
list the momenta at the plates. At s = +5, we have 


Vo(£8)=0, K(XS)- Ko, ps(+8) = po = V2mKo, 


and at s = 0, we have 


YO=LVE4SH, K(0)= Ko + ZeSEp, 
ps(0) = Pm = V 2m (Ko + ZeS Eo). 


For purpose of clarification we note that if V > 0 as shown in Fig. 4.9, we 
have that Eo < 0 and pm < po. 

The first half of this lens is simply the two-plate lens discussed above. Thus 
from eq. (4.19), the transfer matrix for the left half of the system is 


+ _ ( Bpo — Pm)/(2po) Ls 
x ( —3o?Ls(po/pm) (3pm ies) 
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where 
SAL. Pee ee 
2Xe0 Pm + po 

The second half consists of the entrance kick with the momentary momen- 
tum pm, the gap of length S connecting the two momentum states pm and 
po, and the exit kick with the momentary momentum po, with the constant 
electric field — Ep in the gap. 'The transfer matrices of these components turn 
out to be Ms, M, and Mo from eqs. (4.15) and (4.17), respectively, where 
here we use Dm ea of ps. So we obtain the transfer matrix for the right 


half of the system as 


lis = X My: Ns = ( aie 2 ties 1) 


Q&Q = — 


= ( 1 — aL s(po/Pm) Ls ) 
«Xo [L — (po/pm)(1-- oLs)) 1+aLs 
ec — po)/ (2pm) Ls ) 
—3o?Ls(po/ps) (3po — Pm)/(2p0) / ` 


We observe much similarity between Mr and M n; the off-diagonal elements 
are the same, and the diagonal elements are merely switched. The (a|x) 
element of both Mz and Mg is always negative, thus both of the lens halves 
always focus. 

Finally, the calculation of the transfer matrix of the whole lens can be 
conducted, where the similarity of Mr and Mg helps the arithmetic in the 
process. We obtain 


Mr = Mr- ÑM, = (e (z|a)z 3 (t a|x)z ded 


(a|z)r (xlļx)z / N(alz)r (ala)r 
- [or a)r + (x|a)r(a|x)r 2(a|a) r (|a) r ) 
(z|x)r(a|z)r (z|x)r (ala) r + (x]a)r(alx)r 
- - I (3/2) (po m Pm)? / (popa) Ls(3pm = po)/Pm ) 
—3o? Ls(3po — Pm)/Pm 1 — (3/2)(po — Pm)? /(PoPm) 


where eq. (4.18) is used to simplify the result. 

We now consider the focusing property of the three-plate lens. As before, 
Ls > 0, but we now have the factor 3p9 — pm determining the eventual sign 
of (a|z). In case the magnitude of the voltage V is small compared to the 
kinetic energy of the particle, the center momentum pm is similar to po, and 
so 3po — Pm > 0. On the other hand, in cases of extreme voltage V leading to 
large acceleration and large center momentum pm, it is conceivable to achieve 
Pm > 3po. In this extreme case, the three-plate lens actually defocuses. 


4.4.2 The Magnetic Round Lens 


Magnetic round lenses are arrangements of wires wound equidistant to the 
reference orbit, either in long arrangements similar to textbook-like solenoids 
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with a large region of nearly constant axial fields, or in short or "thin" ar- 
rangements where the field on axis rises, reaches a peak, and then falls off 
again. Thin magnetic lenses are the main staple used for focusing in electron 
microscopes, while long magnetic lenses are used in various particle accelera- 
tors for guiding the beam, including applications in muon ionization cooling. 
We have the magnetic field, linear in x and y, given as 


1 1 
Mies j (s) a, BSG (s)y, B, = —V (s), 


which is derived from the corresponding magnetic scalar potential 


1 
V = Vo (8s) - 3 Vo (s) (x? +9"). 
Different from the electrostatic potential in the case of the electrostatic round 
lens, the magnetic potential does not enter the equations of motion directly. 
So we simplify notation by not having it appear in the equations of motion, 
and rather express the fields in terms of the axial center field B,(s) as 


1 1 
By, = —5Bs(s)2, By = ~5Bs(s)y, B, = B,(s). 
Applying this to eqs. (4.8), and linearizing in the similar process for eqs. 
(4.2), we obtain 


ve =a, a =+ b+=—*y, 
Xmo 2 Xmo 
D; 1 B 1 

jux Poa Gehe. cus ae desi aud 
Xmo 2 Xmo (2+) 


It is immediately apparent that the motion of the two planes are coupled, 
which will lead to interesting properties. The longitudinal motion to first 
order is the same with that of a drift, thus 
ly = Dô; + li = — SG + li, Of = ôi 

(2+ no)? 
with length L. 

To study the transversal motion, we first express the equations of motion 
(4.20) in vector notation. Using 


eG 0 Q3 


Jz. (4.21) 


we have 
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For a particle moving parallel to the s-axis without transversal momentum, 
i.e., c; = 0, the second term in the Z” equation, B. /(2Xmo): JZ, acts to produce 
a nonzero transversal momentum. Typically, the situation of B, Z 0 happens 
in the fringe field regions near the entrance and the exit, and it is the main 
reason why particles entering parallel to the s-axis will follow a transversely 
rotating motion inside the magnetic round lens. 

In the following, we will employ a dual approach in understanding the 
motion of particles in the magnetic round lens. The quantitative approach 
will be based on studying the equations of motion and solving them for certain 
specific cases. But equally important is the qualitative understanding of where 
the focusing effects of a magnetic round lens really come from. 


4.4.2.1 Hard Edge Fringe Fields 


We first discuss the fringe field effects appearing in an abrupt transition into 
a region of axial field from a field-free region. In practice this situation arises 
at the edge of a solenoid of very small radial aperture. We use the impulsive 
kick approximation in a similar manner as applied to the dipole edge focusing 
in Section 4.3.2. When entering into the solenoid of the constant field strength 
Bo at the idealized hard edge at s = 0, B, and DB, are expressed in terms of 
the Heaviside step function H and the Dirac delta function 6 defined in eqs. 
(4.5) via 

B;(s) = BoH(s), B! (s) = Boó(s), 


where the entrance lies at s = 0. Applying them to the equations of motion 
(4.21), we obtain 


0+ z z 

BoH ^ Bod A Bo > 

z= a+ | | SEO) ja 969) fyi ee a, 
0— Xmo 2Xmo 2Xmo 


while Z is unaffected. Altogether we obtain the transformation relations be- 
tween the initial conditions % and c; and the final conditions Z; and C; for 
such an idealized thin edge as 


Ze LE Cr =c+ SZ (4.22) 
At the exit the same effect happens, but with an opposite sign of BY, because 
we now have 


B,(3) = BoH(—8), B; (5) = —Bod(8) 
where the exit is at 5 = 0, so that we obtain 
Bo 3, 


Zh = eis Cf = G— JZi. 4.23 
f f TE (4.23) 


Overall we have kick effects that similar to other fringe field cases leave the 
position unaffected, but change the directions by an amount proportional to 
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FIGURE 4.10: Transversal motion of particles entering at positions x; 
initially parallel to the reference axis in the magnetic solenoid. After crossing 
the fringe fields, the particles execute rotations with radii r; equaling half of 
their entrance position. 


position. However, very different from the case of the electrostatic round lens, 
this change in direction in itself is neither focusing nor defocusing but rather 
happens in azimuthal direction, perpendicular to the radial direction. 


4.4.2.2 The Mechanism of Focusing 


Since the action of fringe fields in the transition into axial magnetic fields 
leads to azimuthal kicks, one is led to wonder where any focusing effects may 
arise from. We consider an incoming beam where all trajectories are paral- 
lel to the reference axis, i.e., c; — 0. Furthermore, because of the rotational 
symmetry, it is sufficient to limit our attention to a particle starting on the hor- 
izontal axis, which means Z; = Z; = (x, 0) in eqs. (4.22). After having moved 
through the fringe field, the particle has now picked up a velocity component 
in the negative y direction via @ = (Bo/2x,,0)J Z; = (Bo/2Xmo)(0, —2;). 

We assume that the particle travels for a while in the constant magnetic 
field Bo. In this field, it performs a rotation due to the nonzero transversal 
velocity picked up in the fringe field, which is v = (Bo/2x,,0)x;vo. The radius 
of this rotational motion is given by 


"mu, ym Bo 1 
r= = _——TZT = 
ZeBo ZeBo 2X mo 


where Xmo = po/Ze = ymvo/Ze is used. So the rotation radius is exactly half 
of the radial entrance position of the particle. As it turns out, the factor of 
1/2, which can be traced back to the fact that the radial derivative 0B, /0x = 
—1/2- B} (s) is only half of that in the axis direction, will be the key mechanism 
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to focusing. 

We illustrate the resulting motion in Fig. 4.10, which shows the resulting 
transversal orbits for six particles on both sides of the origin that were initially 
moving parallel to the axis of the solenoid without any transverse motion. 
Each of these particles performs a circular motion with a radius that equals 
half of its entrance position. In particular this entails that the particles do not 
have concentric orbits like in a cyclotron, but rather that after a half period 
of their oscillations, all these particles will pass through the exact center of 
the device. 

This observation now leads to focusing; on their path from their initial 
position z; to the position z — 0 after a half period, each particle loses distance 
to the axis and is thus focused. 

We further observe that all particles rotate with the same angular frequency 
given by 


This entails that all initially parallel particles reach the origin at the same 
time, and also that the relative loss of distance to the axis is the same. So the 
speed with which they move towards the origin is proportional to their initial 
position, and it is a hallmark of thin lens focusing. 

The picture suggests another interesting feature; initially axis-parallel par- 
ticles that are on a common line upon entering remain on a common line, 
shown dashed in Fig. 4.10, in their further motion through the region of 
constant magnetic field. Elementary geometry shows that this is actually the 
case: by connecting any particle in Fig. 4.10 (dot) and the corresponding 
center of its orbit (cross), we can see immediately that the angle between the 
dashed line and the z-axis is always half of that between the radius and the 
x-axis. We further observe that when the particles reach the origin and have 
performed a half revolution around their orbits, the dashed line will coincide 
with the vertical axis, and thus will have performed a quarter revolution; in 
fact elementary geometry shows that the dashed line performs a rotation with 
one half of the rotational frequency of particles. 


4.4.2.3 The Rotating Coordinate System 


'These observations of the idealized case now motivate the treatment of the 
general case, in which fields do not jump abruptly but rather gently depend 
on s. In this case the appearing momentary rotation frequencies of both the 
particles as well as a possible suitable rotating coordinate system are not 
constant but will change with the position s. Thus, we attempt the Ansatz of 
introducing new variables Z that describe the motion in a rotating coordinate 
system via : 

2(s) = R(0(s)) - Z(s), (4.24) 
and we will arrive at the expected result at eqs. (4.28). Here, R(0) is a 
rotation matrix, and for the further discussion, the following matrix properties 
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are useful: 


5-0) = R6) F=(_ SRG qug) = sind f+ cos J 
dR(0) i sinô cosO\ 5; dR(0) e 
(m i) = —J R(0), and Er ES JR(0). 


(4.25) 


Similar to the definitions of Z and Z, we denote the new rotating variables Z 
and Č as 


Note that this does not automatically entail c — Rud, Rather, c differs from 
R-C as we will see now. By computing the derivative of eq. (4.24), we express 
C(s) in terms of R, Z and C. 


&(s) = Z'(s) = OR Z(s) + R(0)Z' (s) = —9 JR(0)Z(s) + É(0)C(s). (4.26) 


In turn, this allows us to express C (s) in terms of Z and Z. 


C(s) = R-1(6)- [ats) + 6 FR(0)Z(s)| = R(—0)e(s) + &' JR(—0)z(s), 


where eqs. (4.24) and (4.25) are used. Next, we calculate the derivative of 
C(s), expressed in terms of Z and c. 


,dR(—0) 
ds 
- (oj - 9? i) R(—0)z(s) + 20' I R(—0)es) + R(—0)Z"(s), 


Z(s) + 0'JR(—0)Z'(s) 


where the relations (4.25) and c — Z' are used. 
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We now insert the equation of motion (4.21) for @’ above, and express C" (s) 
in terms of Z and C using eqs. (4.24) and (4.26). 


Ü'(s) = (ej = oei) R(—0)2(s) + 20 JR(—6)a(s) 


+ Ro Z jas) + 2s js) 


Xmo 2Xmo 
B [v i a i-e] RC-6)&(s) 4 c ! 2) RC Oye) 


B' ^ 5 B, jal] og 
= IG TT ) J — 0? ] +26 (v 4 Lm j Z(s) 
2X mo 2x mo 
B bees 
+a (oZ) JC(s), 
2 (s) 
where the relations (4.25) are used. 
The resulting equation of motion for C looks rather complicated, but a 
closer inspection shows the same factor appearing repeatedly. Indeed, if we 
demand B 


6’ + —— =0, 
2X mo 
which is equivalent to 
* Bs(8) ,. 
(s) —- — ds, (4.27) 
0 2Xmo 


the equation for C’(s) greatly simplifies to 
C' (s) = —0? Z(s). 


This means that the motions of the two coordinates of the vectors Z and 
C fully decouple, and we hence have the following simple set of first order 
differential equations to describe the motion of Z(s) : 


2 
Z'(s)=C(s), | C'(s--8?Z(s = — (28) Es). (4.28) 
2x mo 

So the motion in the rotating system is like a harmonic oscillator with 
varying strength, which is given by the square of the angular frequency 6’ 
of the rotation of the coordinate system, which itself is proportional to the 
longitudinal field B,(s). It is a quite remarkable and yet simple result that 
fully describes the linearized motion in the magnetic round lens. 

We note that, as a consequence, a short magnetic solenoid is focusing, 
and the focusing power is proportional to 0? o 1/x2., x 1/p2, whereas that 
of a magnetic quadrupole is proportional to 1/ymo « 1/po. Therefore the 
advantage of using magnetic quadrupoles compared to magnetic round lenses 
becomes more pronounced for beams of higher momentum. 
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4.4.2.4 The Solenoid with Hard Edge Fringe Fields 


We now study an idealized long solenoid with vanishing field outside and 
with a constant interior field of 


B.(s)=Bo, Br = By =0. 


We begin the discussion with the observation that the form of the equation 
of motion (4.28) entails that the quantities Z and C vary continuously even 
when passing through a hard edge fringe field. In fact, different from the 
situation for Z and c, there are no delta functions appearing which led to the 
discontinuities in eqs. (4.22) and (4.23). So in both the entrance and the exit 
of the hard edge fringe field, instead of eqs. (4.22) and (4.23), we simply have 


Z;-Z, Cp = Gy. (4.29) 
Assuming that the beginning edge of the solenoid is located at s = 0, we 
have that 
Bo , dð Bo 


0 = — = — = — t t $ 
(s) 2n s, ds Dos (constant) 


The angular frequency 0’ is now constant on the inside, and we use the ab- 


breviation 
Bo 
w= (constant). 
2X mo 


Thus we obtain a simple harmonic oscillator solution for Z. Using the initial 
conditions Zo and Co, we have 


7 3 1 > 
Z(s)= cos(ws)Zo + — sin(ws)Co, 
W 


> 


C(s) = —wsin(ws) Žo + cos(ws)Co. (4.30) 


Now this solution has to be expressed in terms of the original coordinates 7 
and c. 

We begin by observing that outside the beginning of the solenoid, we simply 
have 

Z-Z Q=. (4.31) 

Next, because of eqs. (4.29), even just after entering the solenoid, we have 
Z- =Z; C; = &. In the solenoid itself, the quantities Z and C change accord- 
ing to eqs. (4.30) until the end of the solenoid is reached, where they have 
the values Z(L) and C(L). When exiting the solenoid, Z and Č again remain 
unchanged because of eqs. (4.29). In the outside region, there is no field left, 
so in eqs. (4.24) and (4.26) we have 0' — 0, thus the transformation equations 
simplify to 


(L) = R(wL).Z(L, AL) = R(wL)-C(L). (4.32) 
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We now combine the various matrices and use y = wL. We obtain the result- 
ing 4 x 4 transfer matrix M9"t-int-i? of the solenoid of length L for (x,a, y, b), 
representing first entering into the solenoid, then passing through its interior, 
and then exiting out of the solenoid, as a combination of eqs. (4.31), (4.30) 
and (4.32): 


jpout-int-in -— R(y) : Myo(y) 


cos? o sinycosy/w —sinycosp —sin? y/w 
—w sin y cos p cos? y u sin? o — sin Y cos Y 
— E : 2 2 . , (4.33) 
sin y cos q sin^ p/w COS? (p sin y cos p/w 
—w sin? y singcosp —wsinycospy  cos*y 
where we used the 4 x 4 matrices for the harmonic oscillator solution 
cosy sing /w 0 0 
^ —wsiny cosy 0 0 
M = : 4.34 
uo() 0 0 cosy  sing/w |? ee) 
0 0 —wsiny cosy 
and for the rotation 
cosy 0 — siny 0 
A 0 cos 0 — sin 
R(y) = d f (4.35) 


sin Y 0 COS (p 0 
0 sin Y 0 COS (p 


We observe that these matrices commute, i.e., Myo- R — R- Muyo, which 
helps simplify the matrix arithmetic here and below. 

We note that the matrix M?"'-int-i? has unit determinant since both the 
harmonic oscillator solution matrix M. Ho and the subsequent rotation R do. 
It is also easy to show that the longitudinal angular momentum is conserved, 
because 


> 


Zp x & = Z(L) x AL) = (R(oL)- Z(L)) x (R(wL) - C(D)) = Z(L) x C(L) 


E 


= c eZ; + —sin 2) x (-w sin eZi + cos eG.) = Z; Xx Č; 
W 
=z 


Now we may wonder what happens if we study the motion not only from 
the field free regions before to the field free region after the solenoid. For 
this purpose, we first remind ourselves of the 4 x 4 transfer matrices of the 
entrance and the exit edges, which according to eqs. (4.22) and (4.23) are 


10 0 0 1000 
^in | 0 1 —w 0 Cot ee 0 1w 0 
ME 00 10’ MATS 0.0 10 
w 0 01 —w 0 0 1 
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We observe (M9**)-! = M™, and both of the matrices have determinant 1. 

We discussed the behavior of the edges in Section 4.4.2.2, and we can now 
determine the change in the longitudinal angular momentum across the edges. 
At the entrance edge, using eqs. (4.22), 


Zi x Cp = Zi x (G —wI%) -—Zx&-wZx(JZ) 


> > Ti i 3 = = = > E 
— Zi x&-e(s)«( Ba axa tu (et es) é=%x&+urré, 
4 Tti 


where e; is the longitudinal unit vector. Thus the amount of angular momen- 
tum generated is wr? at the entrance and —wr? at the exit. 

'The transfer matrices for various situations below can be obtained by com- 
bining A9wt-int-in. yyin, out and their inverse matrices. The transfer matrix 
of the interior part is obtained by removing the entrance and the exit matrices 
from jj [out-int-in as (feno . ji [out-int-in : (M-L, and we obtain 


1 singcosy/w 0  —sin?o/w 


wnt = 0 cos(2y) 0 —sin(2y) 
~ | 0 sin? y/w 1 sinycosy/w 
0 sin(2y) 0 cos(2y) 


The determinant is again 1. Interestingly, the longitudinal angular momentum 
is not conserved in general when going through the interior, and the amount 
of the change is w(r7 — 17). 

When the particles start inside the solenoid and then exit, which typically 
happens when the particles are “born” inside the solenoid, we obtain M9"t-int 
by removing the entrance matrix from J9"tintin, je, yjpout-int-in , ( yrin)-1. 
or equivalently adding the exit matrix to Mi, i.e., Mout. fint, as 


1 singcosy/w 0  —sin?o/w 
jonas 0 cos? y w —sinycosy 
i 0 sin?y/w 1 sinycosy/w 


—w sinecosp 0 cos? i 


which again has determinant 1. As discussed above, while M9"t-int-i» conserves 
the longitudinal angular momentum, Mi? does not, thus the longitudinal an- 
gular momentum changes in the process. 

Finally, when the particles enter into the solenoid and remain inside, we 
have Mi"*-im of the form 


cos? p sinycosy/w —sinycosy —sin? g/w 
gpsem 2-9 sin(2y) — cos(29) —wcos(2p)  —sin(2y) 
~ | singcosp sin? p/w cos? i sin y cos p/w 


w cos(2y) sin(2y) —w sin(2y) cos(2y) 


The Linearization of the Equations of Motion 107 


as a result of removing the exit matrix from Mo"tintin je, (]pout)-1 . 
Mour-int-in Again the determinant is 1, and the longitudinal angular momen- 
tum changes. 

We conclude our discussion with a more detailed comparison of the treat- 
ment of the solenoid with the results of the electrostatic round lenses. We 
note the similarity between the solenoid including entrance and exit fringe 
fields and the two-plate electrostatic lens. Indeed, the magnetic scalar poten- 
tial, expressed in terms of the angle 0 in eq. (4.27), has risen steadily inside 
the solenoid and is reaching a plateau outside of the solenoid, just as the 
electrostatic potential rose steadily between the plates of the two-plate lens. 

It is thus illuminating to study the case of two solenoids of equal length and 
opposite strength, which will lead to a vanishing magnetic potential change 
at the end of the second solenoid, and conceptually corresponds to the elec- 
trostatic three-plate lens. In that case, the fact that the potential returns to 
its original constant value entailed that the particle's energies are the same 
as before. Here by virtue of eq. (4.27), we obtain that the net rotation of the 
system is zero. 

To be quantitative, we can obtain the transfer matrix of this system by com- 
bining those of two opposite solenoids with hard edges. We remind ourselves 
that the solenoid transfer matrix in eq. (4.33) was obtained as the product 


Mi = R(y) : Muo(¢) 


of the matrices R(y) and Myo(y) defined in eqs. (4.35) and (4.34). So the 
transfer matrix of a solenoid of opposite field is simply given by 


Mz = R(-v): Muo(-¥). 


Now we can use the fact that R(y) and Myo(y) commute as noted above, 
and obtain for the combined matrix 


M = Mz: Mı = R(-¢)- Muo(-¢)- R(v): Maole) 
= R(-e)- R(y) - Mno(—) : Muo(e) = Muo(29), 
where we have used that R(y)~! = R(—y) and Mgo(—v) = Myo(y). So 
indeed any rotation is removed, the x and y motion are fully decoupled, and 


correspond to a simple harmonic oscillator that is equal to that of a solenoid 
of twice the original length. 


4.5 *Aberration Formulas 


In the previous sections we have discussed in detail the linearization of the 
motion in particle optical coordinates, and the resulting transfer matrices for 
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common particle optical elements. However, in general the motion is not 
linear, and in many situations it is important to take into account various 
contributions of the nonlinear effects. 

Unfortunately, as straightforward as the determination of linear transfer 
matrices is in many cases, the determination of nonlinear terms, especially 
those of higher order, becomes exceedingly more difficult using paper and 
pencil methods. In the following we will describe a general method that 
in principle allows the recursive determination of aberrations of higher and 
higher orders, but which in practice quickly succumbs to a rapid increase in 
complexity for higher orders. 

Let us assume we are given the system described by the following ordinary 
differential equation (ODE) 


ane 

=r =f (Ts 

EF fü). 

which satisfies fi (0,5) = 0. We perform a Taylor expansion of the right hand 
side. Because the system is origin preserving, the first contribution is linear, 
and altogether we have 


r= M(s):r- M. Nj, s), 


j=2 


S 


d 
ds 


where the Ñ j are polynomials of exact order j, the coefficients of which may 
depend on s. 
The first step in obtaining a perturbative solution of the system is a lin- 

earization as in the previous sections. We have 

d ^ 

—r= M(s): fr. 

TE (s) 
For this system, we determine a system of n independent solutions le (s), 
k — 1,...,n, that satisfy the initial condition 


Y A T 
(0) = (0,0,-.., 1 ,...0,0)7. 
kth 


We define the matrix 


and observe that the general solution of the linearized problem with initial 
condition 7; is then given by 


In practice, the determination of L may be possible in closed form, depend- 
ing on the structure of M, or may have to rely on numerical integration. For 
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the special case that M is piecewise constant, then for every such piece, 
one can try the ansatz lj = Up - exp(wxs), which leads to the condition 


(Uy exp(wis) = M -Ùk exp(twis), 


an eigenvector problem. If M has n distinct eigenvalues, we are done, and 
depending on whether w; is real or complex, the solutions can also be ex- 
pressed in terms of sin, cos or sinh, cosh. In case of multiple eigenvalues, 
often solutions of the form s - sin, etc., can be found. 

The next step consists of an expansion of f(s) in a Taylor polynomial 


where R; denotes a polynomial of exact order 7 in the initial conditions, the 
coefficients of which may depend on s. We insert this expansion into the ODE 
and obtain 


VALE (s, T;) 


= M(s) -L(s) i + M(s)- M, R5 (s, 7) + V, Q;s È, Rx), 
j-2 


j-2 


where Qy's s(j2 2) are polynomials of exact order j in 7, which result from 
inserting 7 into Ny s. This insertion leaves no linear or constant parts, which 
is due to the fact that the ODE is origin preserving. This will prove crucial 
later in the algorithm for the solution. 

We now sort the result by order. The linear part has the form 


ESTO )2 M (s) - L(s), (4.36) 


and the higher order parts, j > 2, assume the form 


d = ^ > a me es 
Rj G8) = Ms); G9) + Qs È, Ba), (4.37) 


where Q; contains only Hy with k < j. So for j = 2, 3,..., we obtain a 
triangular system of ODEs. It can be solved iteratively in an order-by- 
order manner, and then each of the differential equations for R; contains 
only lower order terms Hi that are already known. In this way, the ODEs 
decouple and become inhomogeneous. 

Initially at s = 0, we have the initial condition r(0) = r}, and 


L0)=f, R;(0,7)=0 for all j =2,3,.... 
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In order to solve the inhomogeneous eq. (4.37) of order j, we first deter- 
mine the homogeneous solution, and then perform a so-called variation of 
parameters. The homogeneous solution is exactly the same form as for the 
linearized part. To obtain the inhomogeneous solution, we make the ansatz 
R,(s) = L(s) - T(s). Then 


LR, : E VE aes» Lf). 


Using eq. (4.36), the first term in the right hand side is 


^ ^ > 


(29) T) = Xt) £G) F (6) = 116). (8). 


Thus, from eq. (4.37), we obtain 


X di m "E 
L(s) g^ (s) = Q;(s, L, Re), 
that is $ 
T()- [ i76, £. Raas, (4.38) 
0 


where the choice of the lower integration boundary as 0 ensures that T(0) 
= 0, which agrees with the initial condition R; (0) = 0. Altogether we have 


R; (s) = L(s) - n i L-'(5)Q;(s, L, F)ds. 


The integral is often referred to as the aberration integral, and the inte- 
grand L-!Q; as the driving term. The complete solution then is obtained 
as 


> 


F(s) = L(s)r, + 27 y (S) = L)ri + XC L(s)- A L-(8)Q;(s, L, Re) ds. 


So, once the linear solution is known, everything else just boils down to 
quadratures. If within a piece in which it is constant, M (s) is diagonalizable, 
the linear solutions can be written as combinations of sin, cos, sinh, cosh and 
s. In other important cases where M(s) is singular, often a complete set of 
linear solutions that are polynomials in s can be obtained. 

In both of these cases, the insertion into the polynomials R;(s) leads to 
terms that are polynomials in sin, cos, sinh, cosh and s. By expressing such 
functions in terms of exponentials times powers of s, one can show that the 
result of any integration can again be expressed as a polynomial of sin, cos, 
sinh, cosh and s. 

For practical cases, it is worthwhile to discuss the complexity of the proce- 
dure. With each new order, the expansion of the ODE becomes more compli- 
cated; then all previous orders have to be inserted, multiplied with the linear 
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inverses, and integrated, resulting in substantially more terms than for the 
previous order. Altogether, the effort increases extremely dramatically 
with the order being considered, and for typical systems, it is practical only 
to orders around five. 

Computer codes that use the above procedure usually contain a library of 
procedures that compute the aberrations for each particle optical element of 
interest. The aberrations of combined systems is then determined from those 
of the pieces with the help of a composition procedure. The Differential 
Algebraic (DA) approach, as described in Chapter 5, allows the computation 
of aberrations to any order in an elegant way without the need of explicit 
formulas for aberrations. 

To illustrate the method of computation of aberrations with a simple ex- 
ample, let us consider the differential equations 


xe’ =a, a! = —z + kz?, 


which corresponds to the horizontal motion in a quadrupole with a superim- 
posed sextupole. We first perform the linearization to obtain 


(2) = C1 9) 9C) 
'The linear solution then is 
1) - CO la Jas)" (Ces coss (a) 2). as 


The next step is the expansion of the ODE, which is already done in the 
given differential equations. We then insert the solution expanded up to the 
second order in the initial conditions x; and a; 


z(s) — (x|z)z; + (z|a)a; + (z|zz)z? + (z|za)z;a; + (z|aa)a?, 


a(s) = (a|z)z; + (a|a)a; + (a|zz)z? + (a|xa)z;a; + (a|aa)a?, 


into the ODE, and obtain 


(|a) xi + (|a) a4 + (|a) x? + (z|xa) z;a; + (z|aa) a2 
= (alx)x; + (a|a)a; + (a|zz)z? + (a|xa)z;a; + (a|aa)a2, 
and 


(ajz) xi + (a|a)'a; + (a|zx) x? + (a|za)' zia; + (a|aa)'a2 
= — [(a|x)a; + (cla)a; + (z|xz)z? + (z|xa)v;a; + (|aa)a; ] 


+k [(x|x)?x? + 2(x|z)(x|a)z;a; + (z|a)?a2 ns -] l 


where we can ignore the higher order terms, since we are interested only in 
order two. 
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'The second order equations then read 


(z|zx) z2 -- (z|xa)'z;a;-(z|aa)'a?—  (alax)x? -- (a|xa)o;a; 4- (a|aa)a2, 
(a|za) z? -- (a|za) z;a;-- (a|aa) a2 = —[(a|xx)x? + (a|xa)a,a;+(x|aa)a?] 


+ k[(z|z)?z? -- 2(z|a) (z|a)z;a; - (:|a)2a2], 


where the last line proportional to k is the inhomogeneous part Qə, and using 
eq. (4.39), 


G, = E 
Qoa k [cos? s - £? + 2cosssin s+ xia; + sin? s- a2] J ` 


We make the ansatz Ro(s) = L(s)T(s) : 


(z|zz)z? + (z|za)z;a; + (z|aa)a2 coss sins F(s) 
= SJ. 
(a|zz)z? + (a|xa)z;a; + (alaa)a? —sins coss 


2o. (Tr\ [l^s-im 4 [ (coss —sins 0 A 
Paus t =| i Gas - | Ge cmd) PEL 
SO 
nal (— sin 5 - Q2a) ds 
0 
si (— cos? 8sin 8: x? — 2 cos 5sin? 3 - aja; — sin? 3 - a?) ds 
0 
2 1 2 
= k | =(cos® s — 1)a? — £ sin? s - gja; + | coss — = cos? s — = | a? | , 
3 3 3 
za] (cos? 3- £? + 2 cos? Ssin 3 - aja; + cos 8sin? §- a7) ds 
0 


1 2 1 
=k (sins - jus) z2— 3 (cos? s — 1) aja; + T MEI f 


Then we obtain Rə (s) = Ê (s)-T (s) , which yields the second order elements 
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of the transfer map 


(a|aa) — k 


PS 
Q 
8 
8 
~~" 
Il 
2v 
VEI SRE NM ONT ONE ES FS 


1.4 1 1 

go ZI 
ee, Df 

-Z sin scoss-+ sins), 


cos? s — e ! 
3 3J- 


1 
sin s cos s + gue 3 


WIM VIN wle 


sin? s — Le eit 
3 3 : 


as Nn 
3 Sin 5 cos s 3 sins]. 
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In a similar fashion, but with much more effort, one can also compute the 


third and higher order terms. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 5 


Computation and Properties of 
Maps 


Up to now, the equations of motion have only been solved perturbatively to 
the first order. Yet the knowledge of the nonlinear part of the solution is 
also needed to determine precisely the performance of a device. "Traditionally, 
this is done analytically using perturbation theory (see Section 4.5 for more 
details). In the past, a tremendous amount of knowledge about aberrations 
for various kinds of devices has been accumulated. Yet this approach is far 
from being systematic, making the simulation prone to errors, and it is diffi- 
cult to obtain an accurate solution for a realistic device where no analytical 
solution exists. In this chapter, a modern method, the Differential Algebraic 
(DA) technique, for computing the transfer map to arbitrary order, will be 
described. But before we embark on this task, we will first classify aberrations 
that can appear in transfer maps in terms of their symmetries. 


5.1 Aberrations and Symmetries 


Recall that the transfer map of an optical system relates final coordinates 
to initial coordinates via 


27 = M(%i), 


where 7 = (x, a, y,b,1,6). In the previous chapters, we were concerned mostly 
with the linearized part of the map, which describes the major part of the 
motion and which can be described by transfer matrices. The matrix elements 
were denoted as (a, a), etc. 


In order to study the effects of the motion very precisely, it is necessary 
to also consider higher order or nonlinear effects. For this purpose we Taylor 
expand the map (in a rigorous sense the question whether the map can actually 
be Taylor expanded is rather nontrivial, but we ignore this here), and use 
names for the coefficients similar to what we had for the linear motion. We 
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write 


z a7 ate yv bY [1 6*5) ate afe yty te 15" , 


Bp Xl ) a" 

ap = X (ajg ayi biligt) weary fep, 
yr = X (yle ate yb 1" 5%) ar belis, 
bp = X (blafsafsa pi ptg) teate yit 5%, 
pec (Le ayo piae aia yiv pP qua. 
p = X (gea sq pO aay beigt, 


where the sums go over all six-tuples (iz, ia, iy, ib, i1, i5); for convenience, they 
are usually sorted by total order. The Taylor coefficients belonging to terms of 
orders 2 or higher are usually called aberrations, as they describe corrections 
to the linear part of the map that are usually small if the phase space variables 
are small. 


In most cases, the freedom of the aberration coefficients is severely restricted 
by the presence of a variety of symmetries. First, in many cases the motion 
of one of the variables does not depend on the values of some other variables. 
For example, if the motion is time independent, we have 


(Z;|a'*afey'vbi*[555) — 0 if 4 40, 


where j = 1,...,6 and Z; is defined in eq. (2.2). Furthermore, in this case 
we know that the kinetic plus potential energy of the particle is conserved, 
and we have that 


(ó|a** ale y*vb/^ [^ 655) 2 0 except (0|ó) = 1. 


5.1.1 Horizontal Midplane Symmetry 


'This is perhaps the most important symmetry in beam physics, as it affects 
almost all devices: bending elements, quadrupoles, sextupoles, higher order 
multipoles, cyclotrons and all the combinations of them. It requires that the 
motion of charged particles is always symmetric around the midplane (the z-z 
plane), which is illustrated in Fig. 5.1. 


In a system with midplane symmetry, two particles that are symmetric 
about the midplane at the beginning stay symmetric throughout the system. 
Suppose that a particle is launched at (xj, Yi, di, ai, bi, ti). After the map M 
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FIGURE 5.1: ‘Trajectories of particles in a system with horizontal mid- 
plane symmetry. 


is applied, its coordinates are 


Lp = ma(zi, Gi, yi, bi, li, ôi), 
af = ma(zi, ai, yi, bi, li, ôi), 
yf = My (Ti, ai, yi, bi, li, Ôi), 
bf = M (Ti, ai, Yi, bi, li, 0i), 
lg = mi(zi, ai, Yi, bi, li, ĝi), 
Of = Ma (Ti, Qi, yi, bi, li, ôi). 


Under the presence of midplane symmetry, a particle that starts at 
(Ti, —yi; di, ai, —bi, ti) 
must end at 
(tp, —yf df ap, —bf,tf), 


which entails that 


i» li, di), 


Tf = m (zi, ai, —Ui; —b 
af = ma(zi, ai, —yi, —bi, li, Ôi), 
( bi, li, ôi), 

bi, li, ôi) 

lf = mi(zi, ai, —yi, —bi, li, ĝi), 
) 


TUF = My\ Ti, Qi, — Yi, — 


is 


—bf = M (Ti, ai, —Yi, — i) 


of = ms(zi, ai, — Yi, —bi, li, Ôi (5.1) 


Thus flipping the signs of y;, b; simultaneously flips the signs of yf, by, but 
leaves xy, ay, lf, Of intact. Flipping the sign of yi, b; simultaneously produces 
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a sign of (—1)'"*^ in each monomial. So i, + ij must be odd for yr, by and 


i, + iy must be even for all others. So we obtain 


(x ait qia yiv piv ix gis 
(a ait qia yn piv ix gis 
(y ait qia yiv piv ix gis 
(b aie qi yo pi gs 
(1 aie aia yiv pie gio 
(6 aie qia yin piv [a gis 


for ty +i, odd, 
for ty +i, odd, 
for iy +i» even, 
for iy +i» even, 
for iy-- ij odd, 
for iy +i odd. 


(rjr) (ma) 0 0 (al) (ad) 
(aj) (ala) 0 0 (all) (alo) 


(jz) (ao 0 0 (H) Qld) 
(ôx) (la) 0 0 (All) (93) 


which is a form seen earlier for electrostatic and magnetic bending elements. 
Altogether, the symmetry entails that to any given order, roughly half of all 
aberrations vanish. 


5.1.2. Double Midplane Symmetry 


Several devices have a midplane symmetry not only around the horizontal 
plane, but also around a vertical plane. This is the case for all electric cylin- 
drically symmetric devices, as well as quadrupoles, octupoles, and in general 
4k poles. In this case, in addition to the requirements we just had, we obtain 
a second set in which the roles of x, a and y, b are interchanged. In this case 
we obtain 


(|...) 2 0 for iy+% odd or i,--i, even, 
(a|...) — 0 for i,+% odd or ig+iq even, 
(yl...) 20 for i,+% even or i-i, odd, 
(b|...)— 0 for i,+% even or i-i, odd, 
(l|...) 2-0 for i,+% odd or ig+iq even, 
(ô|...) 2 0. for ty+% odd or i,-4 i, even, 
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and altogether, about three-fourths of all matrix elements vanish. To first 
order, the matrix must have the special form 


(x|r) (ala) 0 0 0 0 

(air) (ala) 0 0 0 0 

yw- 9 9 (ww (lb) 0o 0 
0 (oy) (bb 0 0 p’ 


0 
0 0 0 0 (QJ) ql) 
0 0 0 (8D (55) 


which is what we observed in the case of the drift and the electric and magnetic 
quadrupoles. 


5.1.3 Rotational Symmetry 


One special case of the double midplane symmetry that we just discussed 
is the full rotational symmetry that round lenses satisfy. In this case there is 
a symmetry going beyond what double midplane symmetry requires; the map 
has to be invariant under a rotation in the z-y plane. Let the rotation angle 


be ¢. The linear transformation R(Z) = R- Z is described in terms of the 
matrix 


cos 0 sng 0 0 0 

0 cos@ 0 sing 0 0 

-|7 sing 0 cosp 0 0 0 

E 0 -—sinó 0 cosd 0 0 A 
0 0 0 0 1 0 
0 0 0 0 0 1 
and we must have that the transfer map satisfies 
MoR=RoM. (5.2) 


In the variables we are currently using, the study of the influence of the 
rotation on the map is somewhat cumbersome, and for this purpose it is 
actually better to choose complex coordinates 


z=xz+iy, w=a +ib, 
as well as their complex conjugates 
Z—mr—iy, W=a-ib. 


In these complex variables, the map R has the simple diagonal form 


é o 0o 0 0 0 

0 & 0 0 0 0 

0o 0 e 0 0 0 
xs 0o 0 0 e2 up o’ 

0 0 0 0 1 O0 

0 0 0 0 0 X 
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and its effect in eq. (5.2) is easy to study. It turns out that in the map only 
those terms that have the form 


zp = zi: fz(2Z,ww), wr = wi: fwlzz, ww) 


are allowed to remain. In passing we note that this situation is remarkably 
similar to what happens in the theory of normal forms of repetitive motion 
[5]. 

There are two types of rotational symmetry: One is characterized by the 
system being invariant under a rotation of any angle, which we call con- 
tinuous rotational symmetry; the other is characterized by the system being 
invariant under a fixed angle, which we refer to as discrete rotational sym- 
metry. The former is widely seen in light optics where almost all glass lenses 
are rotational invariant and in electron microscopes where solenoids are the 
primary focusing elements. The latter is preserved in quadrupoles and all 
higher multipoles. 

For the analysis of both cases we proceed with the above complex coordi- 
nates. After expressing x, a, y and b in terms of z, Z, w and t», the transfer 
map is transformed into 


zp \ up t+ iyf - F, re ee a 
E 2 Gane B c3 (zi; Zi, Wi, Wi, ti, di), 


( E z ) - y & ) ii pou qp) p qia. 
C Pao ratos. dc 
W / jdadadadtja 


where 


Note that besides z and w, also z and w will appear, contrary to the familiar 
Taylor expansion of analytic functions. This is due to the fact that while the 
original map may be Taylor expandable and hence analytic as a real function, 
it is not necessary that the resulting complex function is analytic in the sense 
of complex analysis. 

Given the fact that a rotation by $ transforms z to e'?z and w to e'?w, 
rotational symmetry requires that a rotation in initial coordinates results in 
the same transformation in the final coordinates, i.e., zf — ei? zy, wr > 
e'?wr. Inserting this yields 


(jı — j2 + j3 — ja — 1) 6=2an for w,a,y,b terms, 


(j1 — j2 + j3 — ja) 6 = 2mm for t,d terms, 


where n is an integer. 
For continuous rotational symmetry, which means invariance for all ¢, the 
ji (i = 1,2,3,4) should be independent of à. Thus we have 
ji—Jj2cja—j4-—1 for zgandwy, 
ji—Jj2cja—j4-0 for tganddy, 
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TABLE 5.1: 


Number of aberrations 


Order Ji j2 ja jA 
1 1 0 0 Q0 

0 0 1 0 

3 2 1 0 0 

2 0 0 1 

0 1 2 0 

0 0 2 1 

1 1 1 0 

1 0 1 1 


which eliminate many terms. First, all terms with jı + j2 + j3 +j4 even vanish, 
because jı + j3 and jo + j4 always have the same parity, which means that 
ji — j2 ja — ja is also even. This implies that in rotationally symmetric 
systems, all even order geometric aberrations disappear. As a summary, all 
remaining z and w terms up to order 3 are shown in Table 5.1, where the 
order represents the sum of the j;. To illustrate the characteristic of such a 
map, let us derive the linear matrix from the conditions above. First define 


(z|z) = (€2)100000; (z|w) = (€z)oo1000; 


(w|z) = (Cw )100000; (w|w) = (Cw )001000- 
The first order map is then given by 


vy + typ = (z|z)(xi + iy) + (2|w) (ai + ibi), 
ag + tbe = (w]z) (ai + iyi) + (w|w)(ai + ibi), 


which entails that the linear matrix is 


Rele) —B(zlz) Rew) -Iel 

| Belz) Rea SGho) Who) 

M — | (uz) —S(wlz) R(wlw) —S (wh) m 
S(w|z) O(w|z) S(w|w) S3t(w|w) 


As an example, we show the second order map of a solenoid, which has rota- 
tional symmetry, but exhibits a coupling between x and y, and a and b. 

'Table 5.2 lists the coefficients of the second order map of a solenoid, which 
shows that indeed all second order geometric aberrations vanish, which is a 
consequence of the rotational symmetry. 

Eq. (5.3) also shows that a rotationally invariant system preserves midplane 
symmetry to first order when the first order coefficients are real numbers. In 
fact a simple argument shows that this is true even for higher orders. In 
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TABLE 5.2: 


initial variables x, a, y, b, 0, 6) 
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The second order map of a solenoid (Exponents in the 


Xr af Yf b f l f exponents 
0.999662 -0.408336e-3 -0.186647e-1 0.761884e-5 0 100000 
0.799815 0.999662 -0.149334e-1 -0.186647e-1 0 010000 
0.186647e-1 -0.761884e-5 0.999662  -0.408336e-3 0 001000 
0.149334e-1 0.186647e-1 0.799815 0.999662 0 000100 
0 0 0 0 1.000000 000010 
0 0 0 0 0.163101 000001 
0 0 0 0 -0.112007e-3 200000 
0 0 0 0 -0.876499e-9 110000 
0 0 0 0 -0.219388 ^ 020000 
0 0 0 0 -0.102394e-1 011000 
0 0 0 0 -0.112007e-3 002000 
0 0 0 0 0.102394e-1 100100 
0 0 0 0 -0.876499e-9 001100 
0.370284e-3 0.223860e-3 0.102326e-1 -0.836226e-5 0 100001 
-0.438474 0.370284e-3 0.163792e-1 0.102326e-1 0 010001 
-0.102326e-1 0.836226e-5 0.370284e-3 0.223860e-3 0 001001 
0 0 0 0 -0.219388 ^ 000200 
-0.163792e-1 -0.102326e-1 -0.438474 0.370284e-3 0 000101 
0 0 0 0 -0.134185 000002 
complex coordinates, eqs. (5.1) are transformed to 
xd t = (z) (zi, Zi, Wi, Wi, ti, di) 
= 5 ( az ) zi zi2qy3 piatt di^, 
Jajadajadeja ` "l dijajajajeja 


which shows that all coefficients have to be real numbers in order to preserve 
midplane symmetry. 

For discrete rotational symmetry, invariance occurs only when $ = 2r/k, 
where k is an integer. Hence the nonzero terms satisfy 


ji — j2 + j3 — ja — 1 = nk. 


In general, a 2k-pole is invariant under rotation of = 2r /k. For example, 
for a quadrupole, we have k = 2. Hence the nonzero terms satisfy 


jı — ja + ja — ja = 2n 4- 1. 


Like round lenses, systems with quadrupole symmetry are also free of even 
order geometric aberrations. The linear map of a quadrupole can be obtained 
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from 


fi — j2 + j3 — j4 = t1, 
which is 
zs +iys = (z|z)(mi + iyi) + (z]w)(ai + ibi) + (2|2)(xi—iy:) + (z|W) (ai — ibi), 
ap + ibg = (w|z)(x; + iyi) + (w|w)(a; + ibi) + (w|z)(zi— iyi) + (w|w)(a; —ib;). 


Since a quadrupole has midplane symmetry, all the coefficients are real num- 
bers. Thus its linear matrix is 


(z|z) + (2|z) 0 (z|w) + (zu) 0 
0 (z|z) — (|z) 0 (z|w) — (|i) 
(w|z) + (wlz) 0 (w|w) + (w|w) 0 
0 (w|z) — (wlz) 0 (w|w) — (w|w) 
For other multipoles, we have 


—k+1 
jı — j2 + j3 — j4 = nk + 1 = 1 k =3,4,.... (5.4) 
k+1 


With midplane symmetry, the linear matrix of a 2k-pole is 


(zz) (zw) 0 0 
j (wiz) (ww) 0 0 


iim 0^ 0 Gk) (el) SS 


Since the linear matrix of a 2k-pole (k < 3) is just a drift, it satisfies eq. (5.5). 
Eq. (5.4) shows that the geometric aberrations appear only for orders of at 
least k — 1. The fact that multipoles do not have dispersion determines that 
the chromatic aberrations do not appear until order k. This can be easily seen 
from the equations of motion (3.22). Therefore, a 2k-pole is necessarily a drift 
up to order k — 2. 

A lens with rotational symmetry is frequently called a round lens. The 
main examples are magnetic solenoids and electrostatic round lenses. Since 
rotational symmetry is the highest degree of symmetry a lens can have, round 
lenses have the fewest number of aberrations. As a result, they are widely 
used in low energy electron optical devices such as electron microscopes. At 
high energy, round lenses are too weak to be effective. 


5.1.4 Symplectic Symmetry 


Another important symmetry of the motion is due to the fact that the mo- 
tion is indeed obtained as the solution of Hamiltonian differential equations. 
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In this case, one can show that the Jacobian M of the transfer map M, i.e., 
the matrix 

ƏMı/ðzı O.Mi1/O0zo O.Mi/Oza OM 1/024 OM 1/025 OM 1/dz% 
ƏMəa/ðzı 0.M2/0z2 0.M»5/0za 0.M»/0z4 OM» /0zs O.M 5 /0zg 
OM3/021 OMs3/0z2 0.M3/0za 0.M3/0z; OM a /Ozs 0.M 3 /0zg 
0M4/8zy OM4/0z2 O.M4/Oza OMA4/Oz4 OM4/0z5 OM4/Oz6 |’ 
OMs/021 OMs /Oz2 OMs /0z3 OMs / 02: OMs /Ozs OMs /0z6 
OMg6/021 O0M6/022 O0Me6/023 OM6/0z4 OMe6/0z25 OMa/Ozg 


has to satisfy the condition TM . 
MT jM = J, (5.6) 


where J is the totally antisymmetric matrix 


cooooonxu 
T de d. esie re 
coooeooo 
ÍíGDooooo 
ol a a a oo) 


The proof of this so-called condition of symplecticity certainly goes beyond 
this volume and can be found, for example, in [5]. But we can readily see 
that the symplectic condition, which mixes in a very defined way the terms 
OM,;,/0z; that are themselves power series, entails a large variety of nonlinear 
restrictions between the aberrations. 

From eq. (5.6), we have 


—-jMTjM =I > -MJMTJM = M => - MJMT j — I, 
hence "m ; 
MJMT = J. 
Furthermore, the inverse of a symplectic matrix always exists and can be 
obtained easily with the help of eq. (5.6), as 


jut Mt = - P5 (UTS) a Eo M7 = rJ. 
M-1=-JMT J. (5.7) 
And from the following simple arithmetic 


(1) Jar = (97) 5 (373) = 5333073 
= -JMIM" j= - 555 = J 


ri 
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it is shown that M-! is symplectic as well: 
a T aok ^ 
(ur) jr = J. 


Now let us obtain the determinant of M following Kauderer (page 10 in 
[36]). Through a series of permutations, we can rewrite J, which becomes 


TIN 
H2S( a 


where Î is the n x n identity matrix. Furthermore, M can be written as 


"WT 
ü-($$) 


where A, B, C and D are n x n matrices. The symplectic condition becomes 


ár ON ( 6 PVNÍABN (Of 
BP BP) \=i 6) oD) Aer 6) 
which leads to the relations 
zT Â + ÂTÔ = Ô, —ÓT B + AT D = f, 
=P ALB Casi, DT B + BT D — 6. (5.8) 


Furthermore we need one more mathematical theorem, which is 


Ad ENN m 
det ^ ^ = A. D = t(AD). 
e (5 5) det A - det det( AD) 


Hence we have 


Using this, we obtain 


À B AB Í —A3B 
i (5 5)-«(5 5) «(à j ) 


zu zr. am zd A ry CY A-1B 
= det (2 jos) det Adet(D — CA B). 


Furthermore, using the relations det A = det ÂT, CA-! = (AT)-1CT and 
AT D — CT B = Í that have resulted from eq. (5.8), we obtain 


det (4 : ) = det A” det (D — (A7)-1 67 B) = det (47b ÔTÊ) = 1. 
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One direct consequence of the symplectic condition and the resulting unity of 
the Jacobian determinant is that the volume of the phase space is conserved 
under Hamiltonian motion, which is known as Liouville’s theorem. 


The detailed study of the relationships between matrix elements is cumber- 
some and can be found in [79]. Here we will restrict our attention to what 
happens in the linear case. Considering the constant part of the symplectic 
condition (5.6), we observe that what contributes via the Jacobian is just 
the transfer map. Let us assume no acceleration and coupling between the 
transverse planes, which entails that the transfer matrix is given by 


(sz) (ela) 0 0 0 (ad) 
xd Md (uh) "T 6 an ae 
P YY y y — A NE 
M=| o o (ly) bo (js) |5|} Y Pp 
(Iz) Qo) (ly) U) 1 (6) IS NES 
0 0 0 0 0 1 
where 
» (Gl) Gl) V o _ (Gly) GID) p 1 QI) 
m - (Gs RUE = (09 BUE B= ( 0 1 ) 


Plugging into the symplectic condition yields the equation 


T ô Ee J 0 60 XO De J 00 
GF 228 98: ope s e (5.9) 
br br at} 60 JA NA, E, E 00j 
The left hand side is 
oan e —— LJ, XTjb, + EIR 
TT; TL PTPP oT P TTE 
Ly JL tL JL, Y'JD,-L,JE 


After straightforward algebraic manipulation, we have for each term in the 
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left hand side 


Noc ei 0 (lala) — (rla) (ala) 
|o am [xen eens 0 ) 
CHEM 0 (ulu) (lb) — (bly) (ule) 
dct erg | - [wl 05) — Oly) i] 0 ) 
CN EU 0 1 
Di JD, + Di IDy+ ESE = (5 a 


PTD, PTjB — ( 0 (lly) + Gul) (045) — (bly a 
EE. (1b) + (y|b) (b) — (blb) (ylâ) 
So, the conditions in eq. (5.9) are described as 

(x|x)(ala) — (x|a)(a|x) = 1, 

(yly)(b|b) — (b|y)(ulb) — 1, 

(ijz) + (x|z)(a]8) — (alz)(x]ó) = 0, 

(lja) + (x|a)(a]ó) — (aja)(z|8) = 0, 

(ly) + (y|y)(b]ó) — (b|u)(uló) = 0, 

(ib) + (y|b)(b]) — (bjb)(y]ó) = 0. (5.10) 


The first two of these are familiar and describe the fact that the volume of 
phase space is preserved under the linear transformations generated by particle 
optical elements. The other conditions, however, represent the connection 
between longitudinal and dispersive effects. For them to be satisfied requires 
the use of specific scaling factors for the variables | and 6, and they are the 
reason for the specific choice of the variable & in eq. (2.1) in Section 2.1. 
Next let us study the case that coupling between horizontal and vertical 
planes is present, where the Jacobian matrix M isa4x4 symplectic matrix. 
Similarly, we can divide M into four blocks of 2 x 2 matrices, which is 


R AB 
Mee Ome oy 
C D 
Plugging into eq. (5.6), we obtain 


+ôr jô d F 4 
B o> , 


TAA Tj BTjB4 BTID 
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which can be simplified as 
detA+detC =1, detB+detD=1, ATIB+C™ID =O. 


In general A; B ; C and D are not symplectic matrices. Meanwhile, from eq. 
(5.7), we have 


dex. Ro EVIE Qv... MP OUT 
© Kô ÊJ ET pr S) BL qb 
AC 


where A is defined as 


=-JATS, 


and B, C and D are defined the same way. In addition, we have 


2 0 1 Q11 021 0: 1\_ 422 —012 | (a fy. 4-1 
2m e i 2 d i) (S ma) = (aet â) m 


Since M-! is symplectic, we have 


det À--det B — 1, detÓ -- det b —1, AIC 4 BTJD — 0, 


ATjA+BTIB ATIC + BTID 
CTIA+DTIB CTIC+ DID 


Il 
VEU 
c» es 
e c 


and hence 


where the relation det X — det X, (X = A,B,C, D) is used. As a result, we 
obtain a set of important relations 


det A + det B = 1, det D = det A, det Ó = det B. 


It is worth noting that there are only six independent constraints from the 
symplectic condition for a 4 x 4 matrix. 


5.2 Differential Algebras 


In this section we will provide an introduction to the theory of Differential 
Algebras (DA) which enables the computation of transfer maps to an arbitrary 
order. For reasons of brevity we only provide a limited overview [4]; a more 
complete treatment can be found for example in [5]. For the sake of clarity, 
we first address the simplest case of Differential Algebras, mathematically 
denoted as the structure ;D}. 
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5.2.1 The Structure ‚Dı 


Consider the vector space R? of ordered pairs (ag, a1), @o,a1 € R in which 
an addition and a scalar multiplication are defined in the usual way: 


(ao, a1) + (bo, b1) = (ao + bo, a1 + b1), (5.12) 
t- (ao, a1) = (t 3 ao, t 7 a1), 


for ao, a1, 60,61 € R. Besides the above addition and scalar multiplication, a 
multiplication between vectors is introduced in the following way: 


(ao, a1) * (bo, b1) = (ao - bo, ao : 61 + a1 - bo), (5.13) 


for ag, a1, 09,01 € R. With this definition of a vector multiplication the set of 
ordered pairs becomes an algebra, denoted by ı D1. 

In the same way as in the case of complex numbers, one can identify (ao, 0) 
as the real number ag. Where in the complex numbers, (0,1) was a root of 
—1, here it has another interesting property: 


(0, 1) t (0, 1) E (0, 0), 


which follows directly from eq. (5.13). So (0,1) is a root of 0. Such a property 
suggests thinking of d — (0, 1) as something infinitely small, small enough that 
its square vanishes. Because of this we call d — (0,1) the differential unit. 
The first component of the pair (ao, a1) is called the real part, and the second 
component is called the differential part. Using this notation it becomes clear 
that elements of ı Dı can be written as ao + a1: d, and multiplication amounts 
to multiplying the polynomials (ao + a1: d) and (bo + bı - d) and keeping only 
terms linear in d. 

It is easy to verify that (1,0) is a neutral element of multiplication, because 
according to eq. (5.13) 


(1,0) $ (ao, a1) = (ao, a1) : (1,0) = (ao, @1). 


It turns out that (aọ,a1) has a multiplicative inverse if and only if ag is 
nonzero. In case ao Æ 0 the inverse is 


1 a 
(aon) = (2-3). (514) 
0 


Using equations it is easy to check that in fact (ao, a1)! - (ao, a1) = (1,0). 
An outstanding result of the methods of differential algebras is that dif- 
ferentiation becomes an algebraic problem, and the differential part of the 
difference 
f(x t d) — f(x) 
equals the conventional derivative. Thus, given any differentiable function f, 
we can compute its derivatives by just evaluating the formula and thus obtain 


f'(z) =D [f(x +d) — f(z)] =D (f(x + d), (5.15) 
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where D denotes the differential part. In the last step use has been made of 
the fact that f(x) has no differential part. Hence Differential Algebras are 
useful to compute derivatives directly, without requiring an analytic formula 
for the derivative and without the inaccuracies of numerical techniques. 

The computation of derivatives shall be illustrated in an example using the 


following function: 
1 


= ———— i .1 
1a) = =; (5.16) 
The derivative of the function is 
1/z22—1 
(Os 
(x + 1/z) 


Suppose we are interested in the value of the function and its derivative at 
x = 2. We obtain 


Now take the definition of the function f in eq. (5.16) and evaluate it at 
2+ d= (2, 1). One obtains: 


1 1 DE 
f{2,)] = @h+1/@1 (Q5,040/2,-1/4) — (5/2,3/4) 


= (1/ 6/2), — (8/4) / (6/2) re (735) l 


As we can see, after the evaluation of the function the real part of the result 
is just the value of the function at x = 2, whereas the differential part is the 
derivative of the function at x = 2. 

By our choice of the starting vector (2,1), initially the vector contains the 
value J (2) of the identity function J : x — x in the first component and the 
derivative of I’(2) = 1 in the second component. 

Now assume that in an intermediate step two vectors of value and deriva- 
tive (g(2), g/(2)) and (h(2), h’(2)) have to be added. According to (5.12) one 
obtains (g(2) +h(2), g' (2) 4- / (2)). But according to the rule for the differenti- 
ation of sums, this is just the value and derivative of the sum function (g + h) 
at x = 2. 

The same holds for the multiplication: Suppose that two vectors of value 
and derivatives (g(2), g'(2)) and (h(2), h/(2)) have to be multiplied. Then 
according to (5.13) one obtains (g(2) - h(2),g(2) - h’(2) + g'(2) - h(2)). But 
according to the product rule, this is just the value and derivative of the 
product function (g-h) at z = 2. 

The evaluation of the function f at (2, 1) can now be viewed as successively 
combining two intermediate functions g and h, starting with the identity func- 
tion and finally arriving at f. At each intermediate step the derivative of the 
intermediate function is automatically obtained as the differential part ac- 
cording to the above reasoning. 
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An interesting side aspect is that with the search for a multiplicative inverse 
in eq. (5.14) one has derived a rule to differentiate the function f(x) = 1/x 
without explicitly using calculus rules. 

After discussing the algebra ,D, and its virtues for the computation of 
derivatives, we now address the most general Differential Algebra, the struc- 
ture ,D,, which corresponds to the case of power series of v variables to 
the nth order. It will eventually allow us to arithmetically compute partial 
derivatives of functions of v variables through order n. 


5.2.2 The Structure ,D, 


We define N(n,v) to be the number of monomials in v variables through 
order n. We will show that 


where C(i, 7) is the familiar binomial coefficient. First note that the number 
of monomials with exact order n equals N (n, v — 1). This is true because each 
monomial of exact order n can be written as a monomial with one variable 
less times the last variable to such a power that the total power equals n. 
Thus we have 

N(n,v) = N(n—1,v) - N(n,v — 1): 


the number of monomials in v variables through order n equals the number 
of monomials of one order less plus the ones of exact order n. This recursive 
relation is satisfied by C(n + v, v). Since also obviously C(1 + 1,1) = 2 = 
N(1,1), the formula follows by induction. 

Now assume that all these N monomials are arranged in a certain manner 
order-by-order. For each monomial M we call Iņ the position of M according 
to the ordering. Conversely, with Mr we denote the Ith monomial of the 
ordering. Finally, for an I with M; = a) ---aí» we define Fy = ii! iil. 

We now define an addition, a scalar multiplication and a vector multiplica- 
tion on RY in the following way: 


(a1,..., GN) + (b3,..., bn) = (a1 bi, ,an o oy), 
t- (a1,...,aN) = (t: ay, ,t-an), 
(a1,...,an)- (by, ..., b) = (er... CN), (5.17) 


where the coefficients c; are defined as follows: 


v b 
a=F Y T 7 (5.18) 
O€XwuxN Y P 
M,-M„=M; 


To help clarify these definitions, let us look at the case of two variables 
and second order. In this case, we have n = 2 and v = 2. There are N = 
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C(2 + 2,2) = 6 monomials in two variables, namely 
1, £, y, v, vy, yy. (5.19) 


As an example, using the ordering in (5.19), we have Isy = 5 and M3 = y. 
Using the ordering in (5.19), we obtain for cı through cg in eq. (5.18): 


cy = 01:4, 

c2 = a1 + b2 + a3: b1, 

c3 = a1 : b3 +a3°bi, 

c4 = 2- (a1: b4/2 + a2 09 + a4: b1/2), 
C5 = 01:05 + a2: b3 + a3: b2 + as - by, 
ce = 2 - (a1 - be /2 + ag - b3 + ag - b1/2). 


On nD, we introduce a third operation ð; : 


O0,(a1,..., aN) = (e3,..., CN) 
with 
| 0 if Mj has order n, 
= Alma) else. 


So ð, moves the derivatives around in the vector. Suppose a vector con- 
tains the derivatives of the function f, then applying 0, to it one obtains the 
derivatives of Of /Ox, through one order less. With this third operation, , D, 
becomes a so-called Differential Algebra (DA)[5]. 

While in 1.D4, d = (0,1) was an infinitely small quantity, here we have a 
whole variety of infinitely small quantities that have the property that high 
enough powers of them vanish. We give special names to the ones in com- 
ponents J belonging to first order monomials, denoting them by d Mr. In the 
example of 2D2, we have dx = (0,1,0,0,0,0) and dy = (0,0,1,0,0,0). It then 
follows that instead of eq. (5.15) we obtain 


a 0f Of Pf Pf f 
f (x + da, y + dy) = (s ðr’ dy’ Ox2’ Oxdy’ Oy? (x,y). 


In the general case of v variables and order n, after evaluating f in DA one 
obtains: 
giitizt t f 


S CI S Y > 
Oxy} Oxy +++ Oxy (zi cu) 


v? 


where L (it ni?) is the index of the monomial q} --- a’, as defined in the 
1 v 


beginning of the section. 
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5.2.3 Functions on Differential Algebras 


In this subsection we will generalize standard functions like exponentials, 
logarithmic and trigonometric functions to Differential Algebras. As we will 
see below, virtually all functions existing on a computer can be generalized in 
a straightforward way. 

We start our discussion by noting that for any Differential Algebra (DA) 
vector of the form (0,a1,..., aw) € nDy, i.e., with a zero in the component 
belonging to the zeroth order monomial, we have the following property: 


(0,a1,...,aN)* = (0,0,...,0) fori>n, (5.20) 


which follows directly from the definition of the multiplication in ,D, defined 
in eq. (5.17). 

Let us begin our discussion of special functions with the exponential func- 
tion exp(x). Assume we have to compute the exponential of a DA vector 
that has already been created by previous operations. First we note that the 
functional equation 

exp(z + y) = exp(z) - exp(y) 
also holds for DA vectors. As we will see, this facilitates the computation of 
the exponential considerably. 


exp[(ao, a1, a2, ..., aw)] = exp(ao) - exp[(0, a1,a2, ..., aw)] 


oo 


= exp(ao) i 


01,02, ..., QN). i 01,02, ..., QN). 

(0, a1, à san) enai: Y (0, a1, a QN) 
i=0 i=0 

In the last step use has been made of eq. (5.20) which entails that the 
sum has to be taken only through order n and thus allows the computation 
of the root in finitely many steps. Hence the evaluation of the real number 
exponential exp(ao), which internally on a computer requires a power series 
summation and hence cannot be done accurately, is more subtle than the rest 
of the operations in Differential Algebra. 

A logarithm of a DA vector exists if and only if ag > 0. In this case one 
obtains 


log [(ao, a1, @2,.--,an)] = iog fao E (o. 2. 28... EJ 


ago ao ao 
oo EN i 
= (log(ao),0,...,0) + 3 (1)! - (o. SL L3 
remi 1 ago ao ao 


= (log(ao),0,...,0) + XC (-1)*! - (0. BL E A L3 


Again use has been made of the fundamental property of the logarithm 


log(z - y) = log(z) + log(y) 
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which also holds for DA numbers and leads to simplifications by virtue of eq. 
(5.20). 

As the last example, we will derive a formula for the root function. Even 
though there is a direct method to compute roots by solving a set of linear 
equations for the coefficients of the root, we present here a technique based on 
power series following an approach similar to the exponential and logarithm. 
The root has the following power series expansion: 


Vi*z- ee = = = Lat 


Using this formula and the definitions of addition and multiplication in 
(5.17), one directly obtains for the square root of a DA vector: 


Using the addition theorems for sine and cosine, one obtains formulas with 
finite sums in a quite similar way; in general, suppose a function f has an 
addition theorem of the form 


f(a + b) = ga(b), 


and ga(b) can be written in a power series, then by the same reasoning its 
Differential Algebraic extension is computable exactly in only finitely many 
steps. 

In the meantime, there are numerous codes built on the ideas of Differential 
Algebraic methods, including the code COSY INFINITY [7, 50]. 


5.3 The Computation of Transfer Maps 
5.3.1 An Illustrative Example 


Differential Algebras (DA) can be used very efficiently to compute the trans- 
fer map of eq. (2.3) of particle optical systems in its Taylor series representa- 
tion. 

To illustrate this, let us start the discussion with a very simple example, 
the midplane motion in a 90° homogeneous bending magnet. Let x; and 
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FIGURE 5.2: Motion in a 90? homogeneous dipole magnet. The dashed 
arc is the reference orbit. 


à; — sin(o;) denote the initial distance and scaled transverse momentum 
relative to the reference trajectory (see Fig. 5.2). Then we are interested 
in the values z; and a; = sin(af). Since the trajectories in the magnet are 
circles, we can readily read from Fig. 5.2: 


A= Rsin(o;) = Rai, 


B = RQ - cono) += R (1- VA - at) + xi, 


: B 
a — sin(aj) = -F 


ny = A- R(1—cos(as)) = A- R(13- A - ad). 


These equations allow the computation of the final coordinates z; and a; 
in terms of the initial coordinates x; and a;. However, taking these equations 
and performing all operations in Differential Algebra allows us to even obtain 
all derivatives of xf and a; with respect to x; and aj. These so obtained 
derivatives, evaluated at x; = 0, a; = 0, are then the expansion coefficients 
of the map in eq. (2.3). For the sake of clarity, let us explicitly show how af 
and af are computed. 


Using the ordering in (5.19) and identifying the variable a with y, we obtain 
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using the arithmetic defined in eqs. (5.17), 


a; = (0,1,0,0,0,0), a; = (0,0,1,0,0,0), 
A = (0,0, R,0, 0,0), B = (0,1,0,0,0, R), 
1 1 
—[0,——,0,0,0,—1 .2[0,0, 8, —— 21 
af ( , R’ 3M MS ) Tf ( SM3 , 5^0) (5 ) 


Comparing the obtained result with any matrix code, we find complete 
agreement; as an example, the fact that the second component of x; is zero 
implies that Ox ¢/Ox; = 0 and hence (r|v) = 0, which is a well known property 
of 90? bends. 

In case an additional particle optical element is to follow this bending 
magnet, one does not have to start all over evaluating this new element at 
x; = (0,1,0,0,0,0), a; = (0,0,1,0,0,0), but one can start already with c; 
and ay of eq. (5.21). This way one can save the usually quite involved con- 
catenation process and increase performance significantly. 


5.3.2 Generation of Maps Using Numerical Integration 


In this subsection we will address the general case in which no closed solu- 
tion of the problem exists. We will see that also in this case we are actually 
able to compute transfer maps of arbitrary order for arbitrary particle optical 
elements. Even though we do not have analytical formulas that relate the final 
coordinates to the initial coordinates, there is still a way to computationally 
relate the final coordinates to the initial coordinates, by numerical integration 
of the equations of motion. 

In this case, the final coordinates are still computed from the initial coor- 
dinates using standard arithmetic and functions; however, the relations are 
more complex than in the case of the homogeneous sector. As in any conven- 
tional numerical method, a numerical integrator is used to solve for the final 
coordinates. 

Now blindfoldedly performing all these operations in Differential Algebra 
automatically leads to all desired derivatives of the transfer function, regard- 
less of the form of the equations of motion. In other words, all coordinates 
and fields at any step are power series instead of real numbers. 

Differential Algebraic (DA) techniques have been implemented in many 
programs. They allow the computation of transfer maps of elements with a 
dependence on the independent variable for which an analytic solution cannot 
be obtained. One example is magnets with fringing fields. Another example 
is an electrostatic round lens where the electric field varies with s throughout 
the entire lens. As long as the electromagnetic field can be expressed in a 
differentiable function, the transfer map up to any given order can be obtained 
using the DA technique. 
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5.4 Manipulation of Maps 


In most cases, a beam optical system consists of more than one element 
and it is necessary to connect the maps of individual pieces. Often the inverse 
of a map is needed. Sometimes one part of our system is the reversion of 
another part; therefore, it is time saving if the map of the reversed part can 
be obtained directly from that of the other part. All these map manipulations 
can be done elegantly using DA techniques. 


5.4.1 Composition of Maps 


Whenever a system contains more than one element, which is virtually 
always true, we have to deal with the composition of maps. This is the 
foundation of almost all other map manipulations, as we will see later. 

Let us define o as the symbol for composition. Hence the map M of a 
system consisting of two parts is 


M = M2 0 Mı, (5.22) 


where Mı and Mg are the maps of parts 1 and 2, respectively. According to 
Taylor's theorem, Mə can be expressed as the sum of a Taylor series and a 
remainder: 


M» = Tri TRA, 


where Rn is of order n 4- 1. With the assumption that Mı is origin preserving, 
we have 


[M], E [Mo o Mi], = (Tn TRA) o Mi] x [Tn o Mi] + [Rn o Mi] 
= [Tn 0 Mijn = Ta (Mila). 


Thus [M], can be obtained by composing two polynomials, which is called 
concatenation. 

Concatenation is the most frequently used tool in DA calculations. It is 
extremely efficient when a given optical element appears multiple times in 
a system. Instead of computing the map every time it appears, the map of 
the element can be applied to the system using concatenation after the first 
appearance. 

When the exact formula of Mə is known and it is relatively simple, eq. 
(5.22) can be used directly to compute M, spending only a small fraction 
of the time required for concatenation. The saving comes from the fact that 
Mo is not expanded into a Taylor series. In fact, this method has been used 
whenever a element is a drift or a homogeneous dipole because their maps can 
be obtained from simple geometry. 

Another application of the DA concatenator is the transformation of coor- 
dinates among different codes for the study of the dynamics of beams. For 
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instance, while many codes work in the above discussed curvilinear canonical 
coordinates, other codes use the slope instead of the normalized momentum. 
In certain cases, Cartesian coordinates are used, which may be a better choice 
for certain elements, for example, wigglers, but are usually not very well suited 
for a discussion of the properties of beamlines. The transformation of a trans- 
fer map to a different set of coordinates, which can be expressed as 


Mr Co Mg oC, 


is quite straightforward using Differential Algebra. One simply composes 
the transformation formulas between the two sets of coordinates, the map in 
one set of coordinates and the inverse transformation in Differential Algebra. 
Thus, one automatically obtains the map of the other set of coordinates to 
arbitrary orders. 


5.4.2 Inversion of Maps 


In this subsection, another kind of manipulation, the inversion, will be 
studied. The first step of inverting a Taylor map is to invert the linear part. 
If it exists, it can be done with any standard linear algebraic package. For 
the nonlinear part the inversion is done in an order-by-order fashion. To the 
second order we can write the map and its inverse as 


M = Mı +M, MC = Mi + Mg. 


Note that the subscript denotes the order of the map. Concatenating those 
two, we obtain 


MoM! = (Mı +Mə2)o (MI HM3 +) 23 I+ Mao.M, I M oM! = T, 


where the term M2 o M3 ! is dropped due to the fact that it contains terms 
of the fourth order. As a result, the second part of the inverse is 


My! 23 -Mī ' o0 M20 M11. 


Now let us assume that we already inverted the map up to the (n — 1)th order; 
we write the map and its inverse up to the nth order as 


M =n Mı +Mhn, M^ =n Mz) MS. 
Hence, we have 
M o MC! =n (M3 + Mn) © (Mi! TM) 
=, Ld Mao MI + Mio M E MoM; =n T. 


Since Mn is of the second order and higher, only those terms of the order 
[n/2] or lower in M}! contribute. The result is 


My) =n —Milo Mno (MI! Mj), 
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where m = [n/2] and m > 1. It is clear that the inverse of a Taylor map 
exists as long as the inverse of its linear part exists. In later chapters, we will 
encounter a number of problems that have to be solved through finding the 
inverse of a map. 


5.4.3 Reversion of Maps 


Throughout the development of beam optical systems, mirror symmetry has 
been frequently used. Recently, mirror symmetric systems are being studied 
in great detail in the search for high order achromats. Among the various 
symmetry arrangements, reversion is the most commonly used. For a system 
which is the reversion of another one, the transfer map is the map obtained by 
going through the system in the reverse direction. The reversed motion can 
be described by first reversing time, i.e., switching all the signs of p, and py, 
then going through the inverse map, and finally re-reversing time. The time 
reversal operation can be performed easily using Differential Algebra, and the 
inversion of the transfer map is done as described in Section 5.4.2. 

Specifically, reversion entails that, if a particle enters and exits the forward 
system at an initial point (zi, a;, yi, bi, li, Ôi) and a final point (xy, ay, yr, by, ly, 
df), respectively, it will exit the reversed system at (£i, —ai, yi, —b;, —li, ĝi) 
after entering this system at (ry, —af, yr, —bf, —lf, Of). This determines the 
reversion transformation: 


1 0 0 0 0 d) 
0—1 0 0 0 0 

7 0.0 1 0 0 0 

R=| ò o Ser esp 0 0 
0o 0 0 0 -1 0 
o 0 0 0 0 1 


and hence the map of a reversed system: 
ME = (R) o M^ o (R71). 

In DA representation, MP is computed through concatenation, where 
ME = (R) o M5! 0(R7!). 


In fact, the second composition (R) o M,,! can be done by simply changing 
the signs of the rows for a, b and l. 

An interesting point worth noting is that R is not symplectic. In fact, it 
satisfies the following relation 


RTjR = —J, 


which we call an anti-symplectic relation. 
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Chapter 6 


Linear Phase Space Motion 


In this chapter, we want to study the action of transfer matrices on particles 
by looking in detail to what happens to entire regions of phase space as 
they are transported. This is important because the beam in an accelerator 
is just such a region, and of course we want to make sure that at any time, 
this region is within the beam pipe. 

Let us begin by collecting several observations about two-dimensional trans- 
fer maps M. 


1. M preserves areas. 

2. Different initial points have different final points. 
3. Continuous curves stay continuous curves. 

4. Closed curves stay closed curves. 


5. A point inside a closed curve will stay inside of the closed curve. 


Fig. 6.1 illustrates the second item, and Fig. 6.2 illustrates the other items. 
The first item led us to giving a name, namely “emittance,” to the preserved 
area. The last two items are particularly important, as they tell us that if we 
can enclose our beam within any closed boundary curve, then it is sufficient to 
study the dynamics of this boundary curve alone. It is interesting to note that 
while in the two-dimensional case, closed curves always stay closed curves, it 
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FIGURE 6.1: Mapping of individual points in phase space. 
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FIGURE 6.2: Mapping of a closed curve in phase space. 


is not generally true that in higher dimensions, closed surfaces stay closed 
surfaces. While this is true for linear higher-dimensional transformation, non- 
linear maps can produce some holes in the surfaces through which particles 
that were initially trapped inside the surface may find a way to escape, which 
is a very important mechanism that can lead to instability. 

If in particular M is linear, then we also have the following observations. 


1. Straight lines stay straight lines. 


2. Ellipses stay ellipses. 


Since straight lines stay straight lines, we may manufacture such a boundary 
curve as a polygon, and to study its motion it is completely sufficient to move 
only the corner points. Alternatively, we may try to enclose the beam by an 
ellipse. Before we follow these ideas, let us first study the action in phase 
space of some simple devices. 


6.1 Phase Space Action 
6.1.1 Drifts and Lenses 


As seen in Section 2.2, the transfer matrix of a drift is given by 


#=(3i) 


This matrix leaves a constant and moves x by an amount proportional to a; 
hence it performs a horizontal shearing in phase space as shown in Fig. 
6.3. 
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FIGURE 6.3: Action of a drift in phase space. 


A lens has the transfer matrix 


r= Capi) 


and it leaves z invariant and changes a by a value proportional to x; hence it 
performs a vertical shearing as shown in Fig. 6.4. 


6.1.2 Quadrupoles and Dipoles 


In the case of quadrupoles and dipoles as seen in Sections 4.2 and 4.3, the 
matrices have the following form: 


^ cos @ ksin $ 
Me E c cos @ ) : 


This corresponds roughly to a rotation, except that the x and a coordinates 
are also stretched or compressed; the result is a motion on an ellipse as shown 
in Fig. 6.5. In fact, computing the invariant ellipse of the motion following 
the procedure described in Section 8.1.2, we obtain 


Applying to eq. (6.1), we see from a; = 0 that the ellipse is even upright. 
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FIGURE 6.4: Action of a thin lens in phase space. 


FIGURE 6.5: Action of a quadrupole or dipole in phase space. 


6.2 Polygon—like Phase Space 


In order to study the motion of ensembles of particles under linear transfor- 
mations, it is useful to characterize them by certain simple geometric forms 
in which the particles are contained and requiring only few parameters. The 
two most useful such forms are the polygon and the ellipse. 

A polygon in phase space is uniquely defined by its corner points; and since 
straight lines stay straight lines, it is sufficient to study just the motion of 
the corner points. Fig. 6.6 shows the motion of a polygon under a linear 
transformation. 

Frequently a polygon with just four points is chosen; if its lines are initially 
symmetrically arranged around the origin, they will stay symmetrically ar- 
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FIGURE 6.6: Mapping of a polygon in phase space. 


ranged. Furthermore, a four-point polygon with symmetry around the origin 
is a parallelogram, and so parallelograms always stay parallelograms. 

In many cases it is worthwhile to study how the actual beam width changes 
as a function of the s-position along the beamline. The beam width is appar- 
ently determined by the maximum of the horizontal positions of the corner 
points. In the special case in which we consider motion through a drift, each 
of the corner points moves on a straight line. Furthermore, the corner point 
that is furthest out will stay furthest out until it is possibly overtaken by 
another corner point; during the time it determines the beam width, it entails 
that the beam width changes linearly with s. Since the outermost corner point 
can change from time to time, the resulting beam width is piecewise linear 
as a function of s. 


6.3 Elliptic Phase Space 


The other choice that is worth considering is that of an elliptic phase space 
as shown in Fig. 6.7. In this case, the boundary of the phase space satisfies 
the ellipse condition 


yz? + 2oa + Ba? =e, (6.1) 


which can be written in matrix form using a symmetric matrix as 


(x, a) - a 3) (2) =g 


For future simplicity, we denote the matrix describing the ellipse by 6. 
We first note that there is a redundancy in the description of the ellipse: 
obviously, doubling the values of a, 8, y as well as € simultaneously leads to 
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FIGURE 6.7: An ellipse in phase space. 


the same ellipse. In order to eliminate this redundancy, we demand that the 
determinant of the ellipse be unity, i.e., 


By — 07 21. 


With this choice of the matrix, the quantity £ is a unique measure of its area, 
called the emittance. The four quantities a, 3, y and e are called the Twiss 
parameters of the matrix. 

Now we are ready to study the question how the phase space ellipse changes 
as we pass through a system. Let M be the transfer matrix of the system; 
then the coordinates z,, a; are transformed to x2, a» via 


(2) (a) 
CE 


The new ellipse after the system characterized by M must obviously satisfy 


(x2, a2) < 62 - (2) =e. (6.2) 


and we also have 


ag 


Observe that if we demand det(G2) = 1, even the measure for the occupied 
area e must be the same as before since we know the transfer map preserves 
area. We recall that the old coordinates satisfy 


S BH 
(21,01): 61- 5 =E. 
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FIGURE 6.8: Characteristic points of an ellipse in phase space. 


Expressing xı, a, in terms of x2, a2, which is accomplished by the inverse 
matrix, we obtain 


(£2, a2) - (0) -a i ar) & =e. (6.3) 


It is not difficult to show that (M-1)7 -64- M7! is a symmetric matrix, thus 
we conclude that the resulting object is again an ellipse. So ellipses are indeed 
preserved under linear transformation. Furthermore, since the determinant of 
the matrix is unity (see Section 5.1.4), such a representation of an ellipse by a 
symmetric matrix of unity determinant is unique, and because eqs. (6.2) and 
(6.3) hold at the same time, we must conclude that 


ô = (w= 61 MI. (6.4) 


This equation describes the transformation of the ellipse in phase space 
under the linear transformation. 


6.3.1 The Practical Meaning of o, 8 and y 


As we propagate the beam through a system, the value of 6 changes with 
s, and so do its three characteristic quantities a, 8 and y. It is important to 
study how the three quantities a, 8 and y describe important characteristics 
of the beam. Another important question relates to the shape and degree of 
deformation of the ellipse. Together with the widths, this is characterized by 
the points at which the ellipse intersects the axes as shown in Fig. 6.8. 


148 An Introduction to Beam Physics 


160. 
140. - 
120. - 
100. 


B m) 
b 
D 


060 3 10 I5. 20 25. 30 35 40. 
s (m) 


FIGURE 6.9: Sketch of horizontal (solid) and vertical (dashed) 8 functions 
of a beamline. 


The question of axes intersection can be answered readily. In 
2 ppm 
ya" -Fr2oxa-4 fa*-e, 


we just set a and x to zero, and obtain 


z "B a -J5 
9. — y 0 — B 


Now we address the calculation of the maximal points £m and am, which 
characterize the width as well as the maximum angle in the ellipse. To this 
end, we view the elliptic shape as the contour line of a function, and remem- 
ber that the gradient is always perpendicular to the contour lines. Hence 
the maximum position occurs where the angular component of the gradient 
vanishes, and the maximum angle occurs where the positional component of 


the gradient disappears. For the function 
f (z, a) = yz? + 20a + Ba’, 
we have 
V f = (2yx + 2aa, 2ax + 20a), 


and we infer that for the maximum position, we must have az = — fa, namely 
a = —a/B - x. Inserting this into the ellipse yields 


2 
qz? + 2o (-5«) +6 (-53«) =e => (By—o?)a?- ef, 
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thus 

tm = vef. 
Because of the symmetry of the equations with respect to interchange of x 
and a, we see that also 

am = Vey. 


So the maximal width in x direction is determined by the area of phase 
space e as well as the function f. Thus, 8 plays an eminent role, as it im- 
mediately tells the width of a beam at a given point; and plots of its value 
for different positions around the accelerator are very commonly studied. An 
example of such a plot showing the functions for horizontal (x) and vertical 
(y) motion of a beamline at the Advanced Light Source (ALS) at Lawrence 
Berkeley National Laboratory (LBNL, LBL), California, USA, is shown in 
Fig. 6.9 [54, 21]. 


6.3.2 The Algebraic Relations among the Twiss Parameters 


In this section, we attempt to introduce concept of the phase advance, 
which is the difference in phase between two points on the s-axis, and obtain 
the relations among the Twiss parameters. Let M(s) be the transfer matrix 
of a beamline, which may or may not be part of a periodic transport system, 
from sı = 0 to s2 = s. Let a, 8 and y be the Twiss parameters at s; = 0 and 
a(s), B(s) and g(s) be the ones at s2 = s. From eq. (6.4), we obtain that 


(26) 13) = (Ceo) (2 §) Ge) 
or in another form 
Que (28) LOR CT ^ e 


On the other hand, it is straightforward to show that 


VB -a/VB\ (ya VB 0 - 
0 1/V/B o BJ V—o/VB 1/VB 


which means that the matrix 


de SES ud 
—-a[ VB 1/V/B 
transforms the ellipse into a circle. The new coordinates are sometimes called 
the normal coordinates. This equation entails that 


v e| _ (1/vB a/vB) (1vB 0 Be 
a8] \ 0 VB }\a/vB vB)’ | 
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Plugging eq. (6.6) into eq. (6.5), we obtain 


(ie) (O rt 1//B(s) 0 P 


0 Bs) (3/ VBR) VBC) 
_ (34B alvBY (vB 0 
o vB) \a/vB vB} 


Hence, we have the following relation 


P Pd TOR ka em) 
0 1/VB 0 B(s) 
(ausis vam) “O (e uen] = 
/ vV B(s) 
Defining 
se - [isis vss) "C 
/v/B(s) v/B(s) 


we immediately have 


NE 
VB 
-o/ VB p we 


or equivalently 


which entails that 


the latter holds due to the fact that det(R(s)) = 1. As a result, we have 


pope 


which leads to the relations 


Rii(s) = R22(s), Ri2(s) = — Rai(s). 
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Plugging them into the equation 
Rii(s)R22(s) — Fas(s)Hizi(s) = 1, 


we obtain 
Ri, (s) + Rio(s) = 1. 


Therefore, the matrix elements can be expressed as trigonometric functions 
of a single variable ¢(s) and the matrix R(s) takes the form 


oí. _ [f cosd(s) sin¢g(s) 
m S sin d(s) cos 2) l (6:8) 


Since R(0) = I, 4(0) = 0. Plugging eq. (6.8) into eq. (6.7), we obtain the 
explicit form of the transfer matrix, which is 


re B(s) 0 = Aue 5 3 

(s)/ v B(s) 1/4/B( a/VB VB 
p(s) 0 cosó(s) sing(s)\ ( 1/VB 0 
(s)/A/B(s) 1//B(s) — sin ġ(s) cosQ(s) a/VB VB 


(6.9) 
- í B.[B (cos $s +asin bs) VP. B sin ds do 
ma (s) vV/B/ Bs (cos ġs— as sin bs) 


where 
1 


ae 


and as, Gs and $, represent a(s), G(s) and $(s), respectively. Denoting 


[((a,—a@) cos ps +(1+aas) sin ds| 


where each element is a function of s, it is easy to show that 


tangs = 5 (v|a) 


(z|x) — a (v|a) 


when the values of (x|z) and (z|a) from eq. (6.10) are plugged in. 

From the coordinate transformation point of view, eq. (6.9) illustrates the 
relation between the physical coordinates and the normal coordinates. The 
right most matrix transforms the physical coordinates into normal coordinates 
where the motion in the normalized phase space is a rotation, represented by 
the middle matrix. At end of the transport, the normalized coordinates are 
transformed back to the physical coordinates. 
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FIGURE 6.10: Transformation of an ellipse under a drift. 


For many practical applications, it is useful to explicitly study the trans- 
formation of the ellipse (6.4) through the influence of the matrix M. We first 


observe that if ( | | 
A" te Aa 


= (ty e) 


as simple arithmetic shows. So we have 
T 
> M=) [mo IU —_ [2 a2 
s ( & By az Bo 


T ( (ala) — e, D. 2 ( (ala) — Sey l 
—(zla) (ax) oi fi —(alx) — (u|v) 
Performing the calculations, we see first of all that ao, £5 and y2 depend 


linearly on o4, £1 and J1, and hence the relationship can be written in 
matrix form. Explicitly, we have 


then 


Bo (|x)? —2 (2|x) (|a) (cla)? Br 
a» | = | — (ælļz) (ale) (2x) (ala) + (xa) (ala) — (ala) (ala) | | oa 
t (a|z)* -3 (a|z) (ala) (a]a)" m 


One particularly interesting case is the one where we let an ellipse evolve 
under the action of a drift, as shown in Fig. 6.10. 

If we are interested in the way in which the width of the beam changes, we 
must look at the function 8(s). For the special case of the drift matrix with 
(z|x) = (ala) = 1, (a|z) = 0 and (x|a) = L, we have 


B(L) = (e|x)” By — 2 (ae) (zla) os + (zla) 1 = &i — 2L + L’ 


[o2] 2 a? o1 : 1 
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FIGURE 6.11: Plot of the 8 function in a drift. 


So as a function of L, 6(L) changes quadratically. We also see readily that at 
the point where 
Ic 
V1 
the beam has minimum width, and we have what is called a waist (see Fig. 
6.11). On the other hand, we obtain from eq. (6.11) that 


y(L) =, 


which reflects the fact that the divergence of the beam is not changed in a 
drift. As a result, we have 


1 
B(L) = T(E)’ 


which entails that 
a(L) =0 


at the waist. Meanwhile the behavior of a(L), which is 


(6.13) 


where (* is the value at the waist and s is the longitudinal distance from the 
waist. 
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6.3.3 The Differential Relations among the Twiss 
Parameters 


Now let us consider the case that s is small and to the first order 6(s) =1 
B+ B's, a(s) =. a+ a's and (s) —1 $ s. Similarly, the transfer matrix 
becomes 


* 1+ (8/28 od) s BÀ s 
M(s) =1 ‘ : : ; , (6.14) 
= (1/8) e + (1 +02) ¢'|s 1- (8'/28 +a¢') s 


where each element Mij, the i, j-th element, is computed as 


Mii(s) =1 B -— [cos (es) + asin (o's)| =,1+ (5 2j 8, 


Mis(5) =1 y (8 + 8's) Bim (¢'s) =1 Bos. 
Ma(s) 21— TESIS [a's cos (0's) + (1 +a (a + a's) ) sin (o's)| 


=; 3 |o + (1 + o?) o] S, 


=1 1— (5 tas) S. 


On the other hand, the transfer matrix to the first order of s can be solved 
directly from the equations of motion, which is 


M(s) =1 e i) , (6.15) 


where k is the focusing strength at sı = 0. Equating the corresponding terms 
in eqs. (6.14) and (6.15) yields 


, B 2 
Bo =1, $5*o$-0, -gle t0+a)g]= -k 

which can be simplified to the familiar form 
, 1 
$ = c5 
B 

o —kB- y, 

B = —2a, 


y = 2ka. 
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The behavior of ¢, a, 8 and y are determined by the above set of differential 
equations. 


6.4 *Edwards-Teng Parametrization 


Here we have shown an alternative way of propagating the Twiss parameters 
through a beamline or a ring. The advantage is that only matrix multiplication 
and inner products of vectors (rows of a matrix) are used. In the following 
we will show that this way of tracking the Twiss parameters can be readily 
extended to coupled z and y motions. 


The approach is based on the Edwards-leng parametrization of a four- 
dimensional symplectic matrix. From the relation 


XL = M àN / ico D-sngV[(ÀAÓ Ícosp —D-!sing 
^ Um NJ. \-Dsiny feos 0B Dsing Icosy i 
we obtain 


M = Acos? o + D^! ÊD sin? o, 
N = B cog? pt DAD~ sin? 9, 
m= (Då = BD) sin o cos o, 
f --— (åD — b^!) sin Y cos y, (6.16) 
where Á, B and D are symplectic. It is straightforward to obtain 
A=M- D- tan y, B- N + Datany. 


Subtracting the second equation from the first equation of (6.16) and taking 
its trace, we have 


tr(M — Ñ) = tr (A = B) cos? ip + (DBD = DAD~*) sin? e| 
Ż (trA -tr B) cos? ip + [tr (~ED) -tr (bÀb-!)] sin? o 
= (tr À —tr B) (cos? p — sin? y) 
= 2 (cos — cos u2) cos (29). (6.17) 


In the last step, the assumption that |tr Â| < 2 and |tr É| < 2 is adopted 
following [25], where the matrix M4 is the one turn matrix of a ring. For a 
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generic symplectic matrix, all the conclusions hold after replacing cos yı — 
cos u2 with (tr A — tr B)/2. As a result, we have 


tr(M — Ñ) 


en 
eese) 2 (cos 3 — cos 12) 


From the fourth equation of (6.16), on the other hand, we have 
= — (AD! — D^! B simecose = (AJDTJ — JD JB) sim ecos v, 
and hence 
Jat JT =J (FDAT = BT j'bj) JIE sin o cos o 

=— (bà = 5b) sin o cos q. 

Adding the third equation of (6.16) on both sides, we obtain 
m+ Jat IT — — [b (â+ A) - (B+ B^) D] simecose 

— [Dex A) = (tr B)D| sin y cos Y 

= — (cos u1 — cos uia) sin (2y) D. (6.18) 
Finally, we obtain 


mt IAT jT 


| E a, 
(cos u1 — cos u2) sin (24) 


Now, the only unknown quantity is cos 4, — cos us, which can be obtained 
below. From eq. (6.18), we have 


det (à + JEU) = (cos py — cos ua)? sin? (29) . 


Adding the square of eq. (6.17), we obtain 


di. ias en det(in + JàT JT 
cos fl1 — cos p2 = 5 tr(M — N) MUS 
tr(M — N)/2 


Now let us consider the case that M4 is symmetric, i.e., MT = M, ÑT =Ñ 
and fT = rh. As a result, we have 


m+ IMIT 


ie m VIO et 
(cos u — cos u2) sin (299) ' 
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and so 
Jim JT + $m IP) IT 
(cos [41 — cos u2) sin (2) 
ERN E ET; 
(cos u1 — cos u2) sin (24) 


D- = JDT JT = 


Let us define 


Since det Ô = 1, we have 


pos d —b 
—c aj. 
The relation DT — D-! entails that d — a and c = —b. Combining with 


det D = 1, we obtain a? +b? = 1, which means that D is a rotation. Further- 
more, we have 


dida (Jr? JT. + MTM (det r)f + inm 
nN = ————————————— —— rrr 
(cos Jj — cos u2) sin (24) (cos u1 — cos u2) sin (2y)’ 


and 


bs (rh + Jin JT) mT + (det in) 
T) mA ———————————————— EM —————————————Ó4 
(cos u3 — cos u2) sin (2) (cos [41 — cos u2) sin (2) 


Therefore both D- rh and Df are symmetric and, as a result, A and Ê are 


symmetric as well. 


6.4.1 The Algebraic Relations with Coupling 


Let us start by noting that the 4 x 4 matrix that describes the beam ellipse 
of the coupled transverse motion is symmetric and symplectic, which is a 
result of the fact that the motion of the particles is symplectic. From the 
argument above, we have 


xe M a\ / Ícop D-sneWX(ÀA Ô Ícosp —D-!sing 
^ m Nj \-Dsing feosy ô B/\Dsing [cose S 


where det A = det B = det D = 1. Fora symmetric matrix characterizing the 
four-dimensional beam ellipsoid, the parametrization can be written as 


$- Ícosp D-'singV( 1, 0 Ícosp —D-!sing 
» V—Dsino [cos ô T, Dsing Icos i 


158 An Introduction to Beam Physics 


where 77 = Ty, iy = T. and DT = D-!. Using the relation DT = D-t, we 
obtain immediately 


: n T 
Icosp D !singY | T cosy DT sing T cos —D-!sino 
—Dsiny Icos ~\-Dsiny Icosy Dsing Icosy 


As a result, we have 


TALS x 
$- T cosy - D- l sing T, 0 Î cosg - D- l sing 
Dsing  Ícosp ô fT,/VXDsino  fcosp ? 


which means that the coordinate change that block diagonalizes T in the 
linear form also block diagonalizes T' in the bilinear form. The matrices T; 
and T; can be decomposed the same way as eq. (6.6) and we obtain 


^ "S ^ ^ T ^ ^ PS 
i 0 rt I coso dal Icosp D- es 
a fee A ac n 
0 A /\-Dsinyg lcoso —Dsiny Icosy 0 


TE A 
” —Ax,y/V/ Pa, 1/A/ Buy 


Similar to eq. (6.9), this equation shows that the matrix 


n Ícosp D-!'sinp A, 0 
A4 — ^. ^ X ^ 
—Dsing Icosy 0 A, 


transforms the four-dimensional ellipse into two decoupled circles and the 
transfer matrix can be written as 


TRE Í cos Q2 D; sin y2 Ai 0 R, 0 
E - D, sin Y2 Í cos Q2 0 Ay 0 R, 


A: 0 Ícosqi - D7! sin y 
0 Án Di sin q Î cos Q1 


^ cos sin 
Rey = ( Qa. y e . 


where 


—sindz,y COS Qr, y 
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Similar to the two-dimensional case, we have 


Î cos y2 D; sin y2 Ee 0 
e: Î cos qa )( 0 A 
-u( Í cos pı o & 0 Nee ô ) 
—Dısiny Icosyy 0 yi 0 Pn 


A 
= Ma Ícosqi Dising A 0 Êz! 0 
E m N -Ô sin q Í cos pı 0 Ay 0 B 


M Âs COSQ1 —AD Art sing, MD;'Ay singı RÀ COSQ1 f 
i ThÁsi COSQ1 —ND, Avi sing, MDT Ay sino +Ñ Â; COSQ1 
Defining 


E MEETS P Mii T"12 
M Axı COS gı = nD Agi sin Qi Z ES pr ; 
M21 M22 


we obtain that 
V Bx 0 Sud fidi 1/12 COS x2 — sin $42 
22|-.  ~ . 
—az2/VBr2 1/V Bao Thoi M22 singz2 COS x2 
p. COS $45 + M12 Sin Q3. — 143 SİN Qz2 + M12 COS Qz2 


(6.20) 


M21 COS Q3 + M22 Sin r2 — 131 Sin Qz2 + M22 COS Qz2 


Similar to the two-dimensional case, we obtain the relations 


T1112 ae TU 
tan @z2 = Am. COS Q5 = yV M11M22 — m131031, 
11 


mi, + Mi Mə1Mıı + M22M12 

2.2 = =— UL eee 
Tn111T0122 — M12M21 ™117M22 — ™127M21 

The relations in the vertical plane are the same. Note that both planes give 

the same tilt angle y2 due to the fact that 


det (An COS Y1 — AD Agi sin e) — det (ibi Â, sin Q1 + Ñ Âj COS 1) g 


The four-dimensional case reduces to the two-dimensional case when p2 = 0. 
At this point, the advantage of this procedure based on coordinate transfor- 
mation is rather clear since it avoids the coding of complicated formulas which 
are prone to errors. Instead, it divides the task into a few simple and standard 
steps which are finding the normal coordinates of the initial ellipsoid, tracking 
the transformation matrix to the point of interest and finding the Twiss pa- 
rameters at the point of interest. With the help of the Differential Algebraic 
(DA) technique, it is straightforward to include parameter dependence of the 
Twiss parameters, which can introduce beating due to momentum deviation 
or quadrupole errors. 
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Chapter 7 


Imaging Devices 


In the following chapters, we will discuss what specifically has to be done to 
the map of a system to make the system useful for a specific task. In many 
cases, this requires that certain matrix elements vanish, or sometimes assume 
specific values. The most important device is probably the imaging device, 
in which final positions are independent of initial angles, as shown 
schematically in Fig. 7.1. Represented in the language of the transfer map, 
this entails that 
(zla) 2 0,  (y|b) = 0. 


It is apparent the final angles a; and by are unimportant since it does not 
matter at what angle the rays strike at the image position; so all terms of the 
form (a|...) or (b|...) are insignificant. Additional requirements usually exist 
for the various subclasses of imaging systems. 

There are many types of devices that form images of charged particles. The 
following are a few types that are widely used. 


7.1 The Cathode Ray Tube (CRT) 


The cathode ray tubes (CRT) are a class of imaging that have seen wide 
use in electronic displays, such as the television tube and the oscilloscope. 
As far as practical use, impact on society, and revenues are concerned, the TV 


FIGURE 7.1: Sketch of an imaging system. 
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tube was until recently the most important application of particle optics. In 
this case, for each color an electron beam is deflected vertically and horizon- 
tally by two simple magnetic deflectors in order to sweep over the screen area, 
and the intensity of each beam is adjusted according to the color saturation 
at the respective point. The cathode ray tubes used in the oscilloscopes use 
electrostatic deflection plates to achieve high frequency. Yet the limit of a sin- 
gle pair of plates is around 150 MHz, above which the single pair is replaced 
with segmented pairs of plates. This type of deflector can reach a frequency of 
300 MHz. To reach even higher frequency, a double helix line is used to deflect 
the beam, which has reached 10 to 20 GHz. Nowadays, 33 GHz oscilloscopes 
are commercially available. 

At any given point on the screen, the resulting spot should not be wider 
than the distance between two pixels, so whatever size the beam had initially 
should not be amplified very much; so 


(rjr) and (wl) 


should not be large. 

The requirements for aberrations are usually benign as the phase space 
volume of the beam is small. Yet those aberrations do limit performance and 
over the decades various ways have been developed to minimize them. The 
advent of plasma and liquid crystal displays (LCD) for TV on one hand and 
digital oscilloscopes on the other has caused a precipitous decline of the use 
of cathode ray tubes, which are preferred now only in specialized markets. 


7.2 The Camera and the Microscope 


The purpose of a camera and an electron microscope is to create an 
image of an object formed by light or particle rays. The quantities 


(rjr) and (yly) 


are magnifications, and in most cases it is desirable to have them equal. 
'The electron microscope is just a special case in which both of them are made 
to be very large to increase the resolution. 

If a true image is desired, it is important that the relationship between 
final and initial coordinates is really linear, which requires that all higher 
order position dependent matrix elements vanish, and so 


(alex) 20, (ylyy)=90, (mw|vzx) —0,.... 


In reality, of course, it is difficult to achieve this to higher orders, hence 
some distortion remains. In the case of an electron microscope, this is often 
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FIGURE 7.2: Curvature of the image. 


not detrimental as one can retroactively correct the effects by calculation and 
the resolution is not affected. The effects that appear usually have the con- 
sequence that rectangles are distorted into either the shape of a pincushion 
or into the shape of a barrel; these effects are due to 


(z|jzyy) and (yļyzz), 


which entail that rays that simultaneously have x and y coordinates are either 
pushed out from the center (pincushion) or pulled in (barrel). Higher order 
terms in x and y produce similar effects. 

There should be no effect of energy on position, so 


(z|ó) 2 0 and (y|d) =0 


should be maintained. Similarly, all higher order aberrations involving 6 
should vanish; if this is not the case, some color dependent blurring called 
chromatic aberration may occur, in particular for larger values of x and y, 
an effect that can be easily observed in the case of less expensive binoculars. 

'There should also be no effects of position on initial angles to higher order; 
so it is necessary that 


(z]a«t/^) = (yla'sb®) = 0, 


and since the range of accepted angles corresponding to a and b is often 
rather large, to correct these terms is often very important. If any of them 
prevail, they will entail a color independent fuzziness; in case the order of the 
coordinates a and b is even, the fuzz will be oriented toward one side like the 
coma of a comet; if the powers are odd, it will lead to a uniformly distributed 
fuzziness. 
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Similarly, all aberrations involving positions and angles simultaneously have 
to vanish, and hence it is necessary to have 


(|a** y'va'eb”) = (yla**y'va'eb") = 0; 


if any of them prevail, they will entail a position dependent fuzziness that 
becomes stronger with an increase of the positions x and y. 

Interestingly enough, all higher order aberrations depending on a and b only 
linearly can be corrected by a reshaping of the focal plane; in fact, (x|xa), 
etc., produce a tilt of the image, and (x|xzxa), etc., produce a curvature of the 
image. Fig. 7.2 shows how the matrix element (x|xxa) can be corrected by 
shaping the image position parabolically. 

At any given position, due to the matrix element (x|rra), any ray with a 
given a is moved up or down in proportion to a, where the amount of deflec- 
tion depends quadratically on x; so the rays arrive at the x plane as shown. 
However, tracing the rays backwards shows that they in fact all intersect be- 
fore the plane, and the point where this happens depends quadratically on zx. 
In similar ways, (z|v*a), etc., can be corrected. 


7.3 Spectrometers and Spectrographs 


Spectrometers and spectrographs are devices for the purpose of measur- 
ing momentum, energy, or mass of charged particles. Momentum spec- 
trometers are mainly used in nuclear physics for the determination of the 
momentum distribution of nuclear reaction products. Most of the momentum 
spectrometers known are magnetic because of the fact that the energies that 
need to be analyzed are too high to allow sufficient deflection by electric fields. 
In addition, magnets have two more advantages. They automatically preserve 
the momentum, and can be built big enough to achieve large acceptance. 

The mass spectrometer is mainly used for the analysis of masses of 
molecules, and they can be operated at much lower energies. They have 
a long history, and their applications pervade many disciplines from physics 
and chemistry to biology, environmental sciences, etc. Mass spectrometers are 
also more diverse; among the major types, there are sector field, quadrupole, 
accelerator, energy loss, time-of-flight, Fourier transform ion cyclotron reso- 
nance and ion trap mass spectrometers. 

For all the different types of spectrometers, the goal is to achieve high 
resolution, and in many cases large acceptance at the same time. As 
resolution improves, the need for better understanding and correction of high 
order aberrations increases. In the following, the linear theory of various types 
of spectrometers will be discussed, followed by the studies of aberrations and 
their correction. 
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FIGURE 7.3: The Browne-Buechner spectrograph. 


As alluded to before, in spectrometers the final position is used as a 
measure of momentum, energy, or mass of the particle. T'his requires that the 
final position be independent of other quantities, which in particular requires 
that the device be focusing such that 


(x|a) = 0. 


Due to Liouville’s theorem, it is impossible to obtain focusing and zero image 
size simultaneously, and hence the initial spot width has to be minimized to 
ensure that the final image is narrow enough. Furthermore, the dependence 
on the spectroscopic quantity of interest 6, the so-called dispersion 


(z|6), 


should be large. Finally, in a mass spectrometer where particles of different 
energies are present, the dependence on energy (r|ó) should vanish. In the 
map picture, the linear behavior is thus given by the transfer matrix of the 
horizontal motion 


A (rjv) 0 (ald) 
M = | (alz) (ala) (ald) | . (7.1) 
0 0 1 


Let 2D; be the width of the source. From eq. (7.1), it is clear that the par- 
ticles to be detected focus around a spot at (x|ô)ô with a width of |2(x|x).D;]. 
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FIGURE 7.4: Illustration of the imaging condition arising from Barber’s 
rule. 


Hence, the distance between the centers of particles of different energies must 
be larger than the width, i.e., 


|(x]6)6] > |2(a|x) Di]. 


This sets an upper bound for 1/6, which we call the linear resolving power (or 


linear resolution), 
1 
mcr shee, 
ô max 2(x|z)D; 


Hence in order to increase the resolution, it is necessary to increase |(z|ó)| 
and/or decrease |Dj|. 

As an example, let us study the first broad range momentum spectrometer, 
the Browne-Buechner spectrometer. It contains only a homogeneous dipole 
with 90? bending and circular pole boundaries. The layout is depicted in Fig. 
7.3, and it is applicable for particles with energies up to 25 MeV/u. As it 
turns out, there is a simple condition known as Barber's rule that assures 
that the system is r-focusing. This is in fact the case whenever the source 
location, the center of deflection of the magnet, and the image location lie on 
a straight line, as shown in Fig. 7.4. To prove Barber's rule, we first write 
down the transfer matrix of the horizontal plane, which is 


diras 1 ly cos 0 Rsin0 11 
ATIUM E —(1/R)sin@  cosð 01 


7 pm E iu [; 3 


— (1/ R) sin cos 0 0 1 
[0850 (Io/R)sind (lı +12) cos0 + (R — lıl2/R)sin 0 
= —(1/R) sin cos 0 — (I, /R) sin 8 ! 


where the angles 0, a; and o2 are shown in Fig. 7.4 and the quantities R, 
lı and l5 are the bending radius and the drifts before and after the dipole 
magnet, respectively. Using the relations lı = Rtanay,, l2 = Rtanos and 
a, +a2+6=7, as well as tan A + tan B = (1 — tan A tan B) - tan(A + B), 
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TABLE 7.1: The first order map of the Browne-Buechner 
spectrograph (Exponents in the initial variables x, a, y, b, l, ô) 


Tf af Yf br exponents 

-1.000000 -1.950458 0 0 100000 
0 -1.000000 0 0 010000 
0 0 1.000000 00 001000 
0 0 1.830747 1.000000 000100 
0 0 0 0 000010 
0.519441 0.506574 0 0 000001 


we have 
lila . 
my = ( + Ig) cos + Bo sin 0 


= R (tan o, + tan oa) cos + (1 — tana, tan o2) sin 6] 
tan (o + a2) + tan 6] 
tan (v — 0) + tan 0]. 


= R(1 — tana; tan a2) cos6 - 


= R(1 — tana; tan a2) cos6 - 
Because of tan (7 — 0) = — tan 0, we obtain the desired result 
mi2 = 0. 


In general, it is not hard to obtain the first order map by hand, and it is 
very easy to obtain it using a computer code; the result is shown in Table 7.1. 
With the typical assumption that the half width D; is 0.25 mm, the resulting 
linear energy resolution is 


1 | (lð) 


R= SE 
LT dian lacs 


= 1000. 


Since all electric and magnetic devices produce nonlinear terms in the map 
called aberrations, their impact on the resolution has to be studied when- 
ever necessary. The nonlinear effects are very important in the case of the 
momentum spectrometers due to their large angular and momentum accep- 
tances. Considering the aberrations, the final width will be a new value Azab 
instead of |(x|z) D;|, which has as an upper bound 


Axay = (2|(|x) Di| + |(xlx”)|D7 + K(zlxa) Di Ai] +-+), 


where A; is the half width of the spread in the quantity a. So, the actual 
resolution Rap is 


Roy = Kel 
Axa 
A parameter often used as a comprehensive quality indicator for a spec- 
trometer is the so-called Q value 


Q In (Pmax/Pmin) 


qu In 2 f 
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E 


FIGURE 7.5: Sketch of a generic spectrograph consisting of a single dipole, 
including rays that show the imaging condition and the dispersion of the 
device. 


where Q is the nominal solid angle from which a reasonable resolution is 
expected. The Q-value shows both the geometric and momentum accep- 
tance of a spectrometer. For example, the Q and Pmax/Pmin for the Browne- 
Buechner spectrometer are 0.4 msr and 1.5, respectively. Large Q translates 
into high intensity, which is important for nuclear studies and other situa- 
tions where the number of available particles are small. Large momentum 
acceptance can reduce the number of exposures to cover a certain momentum 
range. 

As discussed before, the purpose of the spectrograph is to translate energy 
information into position information, and in order to have high resolution, 
the position should not depend on anything else if possible. Rays originate 
from a source, travel through the spectrograph, and finally reach the screen, 
as shown in Fig. 7.5. 

It is possible to measure energies in terms of final positions by making the 
dispersion 


(z|ô) 


large. In practice this requires the use of at least one bending element, 
because all other elements have vanishing (x|). The final position should not 
depend on anything else besides 6, and since it is important to be able to 
accept rays covering a wide range of angles, it is necessary to have 


(v|a) = (y|b) = 0. 


So the spot size is limited by (x|r) = 1/(a|a), which is usually kept small, 
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= 


FIGURE 7.6: Sketch of a generic spectrograph as in the previous picture, 
but now subject to aberrations up to order seven. 


and the size of the object, the x size of which is usually kept in the range of 
fractions of mm. 

Any contribution to the final position should be due to energy, and so aber- 
rations depending on initial angle should be avoided. These aberrations are 
usually called spherical, since they historically first manifested themselves 
from the grinding of lens surfaces as spheres, which is much easier to achieve 
than other shapes. So if possible we want 


(z|aa) = (x|aaa) — 0,  (x|bb) = (x|abb) = 0. 


These conditions are not satisfied for the simple spectrograph shown in Fig. 
7.5. Rather, when they are considered, the trajectories of the rays look like in 
Fig. 7.6, showing very noticeable broadening of the image due to aberrations. 

The aberrations involving also x-positions are less significant as positions 
are kept small. The ones involving also y positions are more important as y 
is not necessarily kept small; but if (y|y) is kept large enough, particles with 
significant initial y reach the focal plane with significant final y; the interplay 
of (y|y) and (x|yy) then leads to a parabolic shape of the resulting image, but 
the sharpness of the parabola, which determines the resolution, is unaffected 
by (alyy). 

It is also important to consider aberrations involving energy. Of these, the 
terms depending only on energy of the form 


(z|ó^*) 


do not necessarily have to be corrected as long as they are known, since they 
just turn the relationship of final x and initial 6 into a nonlinear one, which 
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FIGURE 7.7: The effect of the aberration (x|aó). 


still allows an accurate measurement of 6. The most important aberrations 
are usually those that involve initial angles and energies simultaneously, as 
both of these can be large. Of these, the lowest order aberration (x|aó) can 
be corrected by a simple tilt of the focal plane: the final x of a particle, 
which depends mostly on 6, is moved up or down linearly depending on the 
value of a. As shown in Fig. 7.7, similar to before, all these rays with different 
values of a go through a common point at a distance before or after the x 
plane, where the effect of (z|aó) does not manifest itself. The tilt of the focal 
plane is also very clearly visible in the actual example of Fig. 7.6. 

In a similar way, spectrographs can also be used to measure masses of 
particles, and all previous arguments remain valid if the energy deviation ó 
is replaced by the mass deviation ôm. If mass resolution is to be achieved to 
very high precision and the initial energy is not uniform, then in addition to 
the above requirements, it is also important that the final position does not 
depend on 6; this requires that 


(v0) = 0, 
while of course at the same time trying to have 
(2|bm) 


large. The simultaneous satisfaction of these conditions is not possible using 
only magnetic devices; for low energies, it is usually achieved by combining 
magnetic and electric deflectors. 


7.3.1 Aberrations and Correction 


For all the spectrographs mentioned above, nonlinear effects have always 
been a concern for the designers. Looking back to the Browne-Buechner spec- 
trograph, the linear energy resolution obtained was ~ 1000. When aberrations 
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FIGURE 7.8: A magnified drawing of the effect of the aberration (x|aó). 
(Az, xı) is the image on the tilted focal plane caused by the term (x|aó). 


are considered, the resolution of the eighth order drops sharply to around 60, 
which is far below the actually achieved resolution. This shows the importance 
of the aberrations. They have to be studied carefully, and the prominent ones 
have to be corrected. 

Since the entrance slit D; is usually small and the solid angle large, only 
the angle and dispersion aberrations are important. Both map calculations 
and geometric considerations show that all the terms (r|z"*a") (m + n even) 
vanish. Since (|b?) is small (see Table 7.2), the only second order term 
that has a strong impact is (z|aó) which can be as large as 8 mm when a = 
40 mrad and ó — 2096. In fact, this is the most important factor that causes 
the decrease of the resolution. This becomes apparent when the resolution 
considering (z|ad) is calculated: 

(216) 
dut e ES R 
Fortunately (x|aó) is easy to correct, because it only causes the tilt of the 
focal plane, which is illustrated in Fig. 7.8. 

To prove the last statement, suppose a particle of energy Ko(1 + ô) starts 
from the origin with slope ao and goes through an angle focusing system. The 
final position and angle to first order are 


xı = (x|ô)ð, aı = (aja)ao + (a|9)8, 
respectively. Taking into account (z|aó), the result becomes 
Tı = (x|0)ó + (z|aó)aoó, G1 = (a|a)ao + (ald)o. 


Consequently, the system is not focusing anymore. Now consider a second 
particle of the same energy starting from the same point but with a different 
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TABLE 7.2: Aberrations of the Browne-Buechner 
spectrograph at the straight focal plane that are 10 wm or 
larger (Parameters: r,,4,; = 0.23 mm, amaz = 40 mrad, 
Ymax = | mM, bmaz = 10 mrad, maz = 20%. Exponents: 
in the initial variables x, a, y, b, 1,6) 


Coefficient Order | Exponents 
-0.2300000000000000e-3 0 0 
0.1038881145421565 

0.4660477149345817e-4 
0.8311049163372523e-2 
-0.5127000000000000e-4 
-0.1025577239401090e-1 
-0.6562560000000000e-4 
0.3324419665349009e-3 
-0.1873001321765092e-2 
0.1038881145421565e-4 
.1544932687715805e-2 
0.4654187531488613e-4 
-0.1338622665642800e-3 
0.4380445064894873e-3 
-0.2577160791614789e-3 
0.4622130002260186e-4 
-0.9970354551245065e-4 
0.4511716453676560e-4 
0.2219821460952441e-4 


BPRRPRPRPRPRPRP RE 
ODNANTNRWNKFPOWO ANDO AUAN BH 
o 

OonnnFrAPAPPWWWWWNNNNP RK 
OOOoOooo0oo00000000o0ooneconwe 
FPOFRFNORPNWODRFNWODRKRDOOO 
OOOoOoo0o0o000000000000 
OOOoOOooooootK€ooooctM€ooo 
OOOoOoooo0oo0oo0o0o0o0o0ooooooco 
c1014» C0) uo N00 Lr Nf oN OoOorrreo 


angle ao + Aao. The differences in final position and angle between the two 
particles are 


Az; = (z|aó)Aagó, Aa, = (ala)Aao. 


The fact that Aw,/Aa, is independent of Aao indicates that particles of 
energy Ko(1 + ô) are focusing at 


which is proportional to 6. So the tilting angle is 


| Az — (mrjaó) 
tang cet t Salir)" 


where w is the angle between the normal to the focal plane and the z-axis. 
Furthermore, the correction of (r|aó) even increases the resolution under 
certain circumstances. When Azap is smaller than the detector resolution 


Aq, Azq becomes the limitation of the momentum resolution and is inde- 
pendent of v. Since the distance between two peaks increases by a factor of 
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TABLE 7.3:  Aberrations of the Browne-Buechner 
spectrograph at the tilted focal plane that are 10 wm or 
larger (Parameters: r,,4,; = 0.23 mm, amaz = 40 mrad, 
Ymax = | mM, 065,44 = 10 mrad, maxz = 20%. Exponents: 
in the initial variables x,a, y, b, 1,6) 


Coefficient Order | Exponents 
-0.2300000000000000e-3 0 0 
0.1038881145421565 

-0.9320954298691634e-4 
-0.5127000000000000e-4 
0.1079501821087351e-1 
-0.6562560000000000e-4 
0.3324419665349009e-3 
-0.2105079060488440e-3 
-0.1038881145421566e-4 
.1654199766297046e-2 
-0.1329767866139604e-4 
0.5138469075870278e-4 
-0.2242022047923136e-4 
0.1745846601755523e-3 
0.1305877080464639e-4 
0.2771149616039620e-4 


BPRPRRPRPRRE 
O) O1 i45 Q) IQ. 5 O (0 00 HO) O1 i5 0 N EE 
o 

Oros dS d 0oQgoogoNNNMubmrÁóe 
OOoOoooooooooooeoc 
OCNOHENWOOKENWODDCSO 
OOOoOoooooooooooo 
OOOOoOoo0oNvoooovoo 
OOOOoOoooooooooooo 
aow BRPWNEWENBFONOHHO 


1/ cos% while Avg remains unchanged, the resolution is 


which is greater than the linear resolution. Rigorous computation of the actual 
resolution requires that the aberrations on the tilted focal plane be calculated. 

For the Browne-Buechner spectrograph, eighth order maps of both straight 
and tilted focal planes are computed. Table 7.2 shows the aberrations on the 
straight focal plane, where (x|aó) is clearly the major contributor. Table 7.3 
contains the aberrations on the tilted focal plane, where (x|aó) vanishes and 
others are either reduced or unchanged. Both tables are taken from [5]. The 
resolution after the cancellation of (a|ad) bounces back to 780 (or 1560 in 
momentum), which is quite close to the linear resolution. This entails that 
the remaining aberrations are weak enough to be ignored at the time when it 
was first designed. 

When higher resolution is required, more aberrations have to be corrected. 
Usually (x|a?) and (x|b?) are corrected first. Then (z|a?), (z|a26) and (zlab?) 
are tackled. If necessary, fourth order terms like (x|a*) also have to be min- 
imized. At least in one instance, eighth order terms, such as (x|a?0?), are 
corrected, too. This is done in Quad-Dipole-Quad (QDQ) spectrographs. The 
pole faces of the last quadrupole are shaped to produce quadrupole, sextupole, 
octupole and decapole field components. Generally, corrections are done by 
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placing magnetic multipoles of the same order into the system. They are ei- 
ther separate adjustable multipoles or combined fixed elements to dipoles or 
quadrupoles. 


7.3.2 Energy Loss On-Line Isotope Separators 


As part of the growing developments in the study of radioactive beams, on— 
line isotope separation is more and more widely performed. As in other 
mass spectrometers described above, different isotopes have to be laterally 
separated. Yet they can no longer be bent by electrostatic sectors anymore 
due to their high energy. Among the different methods, the energy loss method 
is a very interesting one. First, particles of the right rigidity are selected by 
a slit at the dispersive focal point. Second, the selected particles are sent 
through an energy degrader which creates new momentum spread according 
to the mass of the particles. And finally, a second slit picks up the desired 
nuclei. The best spatial separation can be achieved when the whole beamline 
is achromatic and the degrader preserves the achromaticity. This is due to 
the fact that nuclei of the same mass but different momentum are focused at 
the same spot. Hence it is important to study the transfer map of an energy 
degrader and the achromatic conditions. 

In most of the cases, the degrader is thin enough for us to neglect the 
straggling effect from multiple scattering. So a degrader is the combination 
of a drift and an energy loss, which has the first order matrix 


1 d 0 
M,- 0 1 0 
(d|xz)a (dla)a (9|0)a 


Here d is the thickness of the degrader. It is easy to show that the spatial 
part of Ma can be reduced to a unity matrix. After applying a negative drift 
behind the degrader, M4 becomes 


. 1 -d 0 1 d 0 1 0 0 
Ma={0 10 0 1 o ies e 20 1 0 
0 0 1/ X(0|z)a (3]a)a (616)a (ó|v)a (la)a (00)a 


For heavy ions at the intermediate energy region of > 10 MeV/u, the energy 
at the exit can be described with the formula 


d l/y 
x - x (1-2) 


where K is the energy at the entrance of the degrader and R is the range. 
Furthermore, 


REA CURVAS 
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FIGURE 7.9: Layout of an example of a fragment separator. There is 
an intermediate image at the mirror symmetry plane in the middle, and the 
system is achromatic. 


where A is the atomic number, Z is the charge, and k and y are constants. 
Within this model, the matrix elements can be obtained as 


(Bl) -1 (5:) 
Z)g = ——————— | — 3 
^^ SRo( — do/ Ro) \ Ox) vo 
(dla)a = 0, 
1 
ó|ó)g = ———— 
(514 = oR 
where do and Ro are values for the reference particle. 
To achieve achromaticity, the system has to satisfy 


(zla) = 0 and (a|d) = 0. 


By denoting the parts before and after the degrader with subscript 1 and 2, 
respectively, the achromatic conditions are obtained and they are 


(z|x)2(z|a)ı + (a]a)2(ala)i + (8) (C9 ]ar)a(]a)1 + (a)a(ala)i} = 0, 
(|a) (]ó)1 + (x]a)2(alð)ı + (215) 2{ 6] @)a(@ld)1 + (0]a)a(a9)1 + (616 )a} = 0. 


When the previous model applies, (ó|a) vanishes. Together with the require- 
ment that both parts are focusing, that is (|a); = (x|a)a = 0, the conditions 
can be reduced to 


(z|a)i = (v]a)a = 0, 
Dı Mə + Do(Di(ó|x)a + (6|6)a) = 0, (7.2) 
where D = (z|ó) and M = (a|z). 


An example which uses an achromatic degrader for isotope separation is 
shown in Fig. 7.9. When operated on the achromatic mode, the system is 
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mirror symmetric about the degrader and forms a dispersive image at that 
point. The matrices of the two parts are 


. M O0 D . 1/M, 0 -Dj/Mi 
M4 = (a|z)1 1/Mi 0 ; M» = (a|z)i M4 — (a|z)1Di1 
0 0 1 0 0 1 


From eq. (7.2), the achromatic condition for the fragment separator is 
Dj(0|x)a + (6]0)a = 1. 


Therefore the shape of the degrader can be decided, which is a wedge with 


the slope 

Od u ydo 

Ox i Di i 
It is straightforward to verify that the system with the degrader is indeed 
achromatic 


(8lô)a 0 0 
M = Mo- Ma: Mi = | Mi(alz)i(2— (d|x)aD1) 1 0 
M,(6|x)a 0 1 


7.4 *Electron Microscopes and Their Correction 


The field of electron optics is one of the oldest branches of beam physics, 
which is a direct descendant of light optics. Recently, it is also one of the most 
active branches due to the advancement of aberration correction in electron 
microscopes. In the past decade, the TEAM project (Transmission Electron 
Aberration-corrected Microscope) has been developing the next generation of 
electron microscopes. Initial experiments using the latest aberration-corrected 
scanning transmission electron microscope (STEM) demonstrated the scien- 
tific potential of aberration-corrected electron microscopes. 

Since their invention in the early 1930s, electron microscopes have been used 
in various areas ranging from scientific research to industrial production, and 
various different types of microscopes were developed for the specific needs 
of those applications. The main variants are the transmission electron micro- 
scope (TEM), the scanning transmission electron microscope (STEM), the 
photoemission electron microscope (PEEM), the low energy electron micro- 
scope (LEEM) and the scanning electron microscope (SEM). 

Among them, TEM and STEM are used mainly to study the bulk proper- 
ties of materials with electron energy ranges from 100 keV to 1 MeV. PEEM, 
LEEM and SEM are used to study surface properties of materials with elec- 
tron energy below 30 keV. In a PEEM, secondary electrons generated by 
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FIGURE 7.10: Spherical (top) and chromatic (bottom) aberrations. A 
Gaussian image exists at the dashed line. Higher and lower energy electrons 
are represented by AK/K > 0 and AK/K <0. 


photons are imaged. In a LEEM, electrons reflected from the sample surface 
are imaged. In a SEM, an electron probe the size of a few angstroms is formed 
on the sample and secondary electrons are collected. 

Except for LEEM, which needs à magnetic separator to separate the in- 
coming and reflected electron beams, most microscopes without aberration 
correction consist of so-called round lenses only. There are two types of 
round lenses used in electron microscopes: the electrostatic and the magnetic 
lenses. Electrostatic lenses are used in PEEMs, LEEMs and some SEMs, 
whereas magnetic lenses are used in TEM and STEM where the higher 
energies of the electrons make the use of electrostatic lenses impractical. 

The rotational symmetry of these lenses ensures that only a small number 
of aberrations remain to degrade the linear or, so-called, Gaussian image 
properties. The first kind are the spherical aberrations, which lead to 
a blurring of the image due to the opening angle of the electron beam at 
the object. In the electrostatic case, the first relevant terms are (x,aaa) 
and (y,bbb), which are equal due to the rotational symmetry, and usually 
denoted by Cs. There are also terms of the form (x,a) and (y, 0?) which 
are denoted by Cs, the significance of which is discussed below. Second and 
fourth order terms and cross terms like (x, aab), etc., vanish because of the 
mirror symmetry. 

In the magnetic case the situation is a bit more complicated since magnetic 
round lenses can rotate the image in the x — y plane. However, considering 
the motion in the rotated coordinate system, it can be seen that the matter 
reduces to quite the same situation as in the electrostatic case. The top picture 
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of Fig. 7.10 illustrates the effect of the spherical aberration. 

The second kind of aberrations are the chromatic aberrations, which 
arise as a combination of the opening angle and the energy spread of the 
beam. The lowest order chromatic aberration for a round lens is 2, and in 
the electrostatic case has the form (a,ad), which because of symmetry also 
equals (y, bó). These chromatic aberrations are usually denoted by Cc. In the 
magnetic case after transformation into the appropriate rotated coordinate 
system, the situation is again the same. The effect of the chromatic aberration 
is illustrated in the bottom picture of Fig. 7.10. 

Even in the early days of electron microscopes, the possibility of correcting 
the remaining aberrations had been contemplated. Yet the initial result of 
theoretical investigation was not very encouraging. Scherzer [62] showed that, 
for a round lens without reflection, the spherical and the chromatic aberrations 
do not change sign, the same as the focusing force of such a lens (see Section 
4.4.1). 

Specifically, electrons with a larger angle are focused stronger and electrons 
with higher energy are focused weaker. As a result, aberration correction 
requires violation of the above assumptions, through using either multipole 
elements, electron mirrors or time varying fields. Early attempts on aberra- 
tion correction, between the late 1940s and the early 1990s, failed mainly due 
to insurmountable technical difficulties. Hence the development of electron 
microscopes up to the early 1990s follows mainly the line of aberration re- 
duction through optimization of the lens design and improvement of stability. 
The initial success of aberration correction came when the technology was 
ready in the mid-1990s [59, 39]. 


7.4.1 Aberration Correction in SEM, STEM and TEM 


The first successful aberration correction was reported in 1995, where the 
spherical aberration Cs and chromatic aberration Co were corrected in a low 
voltage SEM (scanning electron microscope). The corrector consists of four 
multipole elements (see Fig. 7.11), which was originally proposed in the early 
1960s. The two outer elements are electrostatic multipoles and the two inner 
ones are superimposed electrostatic and magnetic multipoles. The corrector 
consists of two identical quadrupole doublets, where the two quadrupoles 
are physically the same, excited at the same current but with opposite polarity. 

Furthermore, it is arranged such that the so-called cosine-like ray of the 
horizontal plane, which in conventional transfer map terminology corresponds 
to the (x, x) matrix element, goes through the center of the left inner element, 
while that of the vertical plane goes through the center of the right inner 
element. This entails that (x|aó) and (y|bó) can be corrected independently 
from each other. 

More importantly, rays in the vertical plane coincide with those in the 
horizontal plane going backwards. This layout minimizes the breaking of ro- 
tational symmetry due to the introduction of multipoles. The most noticeable 
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FIGURE 7.11: Cosine-like rays (left) and sine-like rays (right) of the 
quadruplet corrector. 


FIGURE 7.12: A quadrupole-octupole C's corrector. The rectangles rep- 
resent quadrupoles and the hexagons represent octupoles. 


consequence is that the terms (x|aó) and (y|bó) are equal, restoring the rota- 
tional symmetry of the chromatic aberration. The superimposed electrostatic 
and magnetic quadrupoles form first order Wien filters that can correct 
chromatic aberration. 

In other words, the different energy dependencies of the electrostatic and the 
magnetic forces allow adjusting the chromatic aberration while maintaining 
overall linear focusing. In addition, the rotational symmetry of the spherical 
aberration is partially restored. Specifically, for a rotational symmetric sys- 
tem, we have (z|a?) = (v|ab?) = (y|a?b) = (y|b?). For the present corrector, 
the relations among the four terms are (z|a?) = (y|b?) and (a|ab”) = (y|a?b). 
'These relations show that two families of octupoles are needed to correct the 
spherical aberration using a corrector with the same symmetry. 

For this corrector, the octupole components of the inner multipoles correct 
the terms (z|a?) and (y|b?), and those of the outer ones correct (z|ab?) and 
(yla?b). With Cs and Cc corrected, the resolution of a 1 keV SEM could 
reach below 2 nm. 

Meanwhile, another scheme was developed and used to successfully correct 
third order spherical aberration in a 100 keV STEM where a certain combi- 
nation of quadrupoles and octupoles is used. It uses a similar layout for the 
quadrupoles as the Cs and Cc corrector above, which is shown in Fig. 7.12. 
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The linear optics consists of two identical quadrupole doublets with equal 
spacing between the quadrupoles and equal strength of all quadrupoles. The 
two outer octupoles correct the terms (x|a?) and (y|b?), and the middle one 
corrects (z|ab?) and (y|a?b). Due to the large difference in transverse position 
of the horizontal and vertical rays in the outer octupoles, the two knobs are 
mostly orthogonal. A resolution of 0.78 A has been achieved using such a 
corrector. 

While the introduction of the C's corrector into an electron microscope 
corrects the third order spherical aberration, it also generates much larger 
fifth order spherical aberrations (C5) through the combination of the 
objective lens and the octupoles and that among the octupoles, which becomes 
the limiting factor as the resolution reaches toward 0.5 A. The equation below 
illustrates the origin of Cs through combination. 


eee as oe) 
E + ele) 
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(|a) koi? 
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Since C5 is proportional to (x|a), it vanishes when (r|a) vanishes, i.e., when 
the first element (right) is imaged onto the second one (left). It can be seen 
from Fig. 7.12 that this condition is not met for this corrector. More recent 
designs of Cg correctors have taken this into account and correct Cs as well. 
By adjusting the image location, the value of C5 can be varied and canceled. 

In order to correct Cs in a transmission electron microscope (TEM), extra 
attention has to be paid to maintaining a large usable object area, the so-called 
field of view. This usually requires that at least 2000 image points are well 
resolved in one dimension. It turns out that the simple quadrupole-octupole 
corrector shown in Figs. 7.11 and 7.12 does not meet this requirement. The 
main reason is that the cosine-like ray of the objective lens, i.e., the sine-like 
ray of the corrector, goes through the octupoles at large amplitude. As a 
result, it is deflected by the octupoles, generating large aberrations that limit 
the field of view. 

The first successful corrector Cs on TEM consists of two round lenses and 
two sextupoles, which is shown in Fig. 7.13. The round lenses form a so- 
called —I transport between the centers of the sextupoles, i.e., the linear 
transfer matrix is a negative identity. It turns out that this cancels the second 
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FIGURE 7.13: A sextupole Cs corrector. The ellipses represent round 
lenses and the rectangles represent sextupoles. 


OL Li Lə Hj, Ls Li He 


FIGURE 7.14: Sine-like and cosine-like rays of a Cs corrected TEM from 
objective lens to the end of the corrector section. OL: objective lens, Li to 
L4 : round lenses, Hı and Hs : sextupoles. 


order aberrations generated by the sextupoles as well as C5 from combination. 
Furthermore, the third order spherical aberration can be corrected due to the 
fact that Cs from the sextupoles, which is proportional to k?, is rotationally 
symmetric and of the opposite sign of that of the round lenses. 

Fig. 7.14 shows such a corrector in a TEM, together with the objective lens 
and the so-called transfer lenses. Note that the cosine-like ray of the objective 
lens goes through the centers of the sextupoles, hence it is unaffected by the 
corrector, helping to maintain the field of view. A slightly modified version of 
such a corrector has also been used to correct C's in STEM. Recently, a STEM 
named TEAM 0.5 (Transmission Electron Aberration-corrected Microscope) 
has achieved the resolution of 0.5 A at 300 keV using such a corrector. 

With the success of correcting the spherical aberration in TEM, scientists 
and engineers in this field have set out to build a TEM that is both Cs and 
Co corrected. Successful as it is, the sextupole corrector is not capable of 
correcting Cc and it is not obvious how to modify the sextupole corrector to 
include Ce correction. As a result, attention has been focused on the option of 
a quadrupole-octupole corrector. After many attempts, Rose [59] developed 
a design which satisfied the requirement and was later adopted by the TEAM 
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FIGURE 7.15: Sine-like and cosine-like rays of the TEAM corrector. El- 
lipses: round transfer lenses, rectangles: multipoles. The focal length of the 
middle elements is half of that of the far outer ones. The ratio of the sine-like 
rays at the middle elements is 5. 


FIGURE 7.16: Sine-like and cosine-like rays of the TEAM microscope from 
objective lens (OL) to the end of the corrector section. The omitted part in 
the middle is the multipole corrector shown in Fig. 7.15. Lı and Lə : adapter 
lenses, O2 : octupole used to cancel (r|ab?) and (y|a?b). 


project and built. 

As shown in Fig. 7.15, the corrector consists of two multipole quintuplets, 
each replacing one sextupole in the sextupole corrector (Fig. 7.13). The mid- 
dle element of each quintuplet is a superimposed electrostatic and magnetic 
multipole which is responsible for correcting the spherical and the chromatic 
aberrations. Each quintuplet is mirror symmetric about its center and each 
half is again mirror symmetric about its own center. Each half of the quin- 
tuplet is point to parallel and parallel to point. Each quintuplet is a —I 
transport. The result is the cancellation of a large number of aberrations. 
Since one of the strengths of quadrupole families is a free parameter, it is 
chosen such that the relative difference in the horizontal and vertical beam 
width at the center of the quintuplet is large. As in the case of the sextupole 
corrector for TEM (Fig. 7.14), the cosine-like ray of the objective lens is not 
affected by the aberration corrector (see Fig. 7.16). The second family of 
octupole can be placed either at the center of the corrector or, as shown in 
Fig. 7.16, after the corrector. 
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FIGURE 7.17: Spherical (top) and chromatic (bottom) aberrations of an 
electron mirror, showing the possibility of reversing the sign of that of a regular 
round lens. Vertical dashed line: Gaussian image. Higher and lower energy 
electrons are represented by AK /K > 0 and AK/K < 0, respectively. 


Such a corrector posed unprecedented challenges on technology in terms 
of tolerance on alignment errors and power supply stability. The required 
tolerance on alignment error is around 14 jum between adjacent elements, 
which is difficult but achievable. For the superimposed multipole elements 
which are responsible for aberration correction, the noise level of the current 
and voltage supplies have to be below 1.5-1075 (AJ/|I|) and 4-1078 (AU/[U |), 
respectively. This level of stability was unheard of even a few years ago. Yet 
recently, it has been possible to achieve AI/|/| = 8.1 - 107° and AU/|U| = 
3.6 - 107°, fulfilling the design criteria. A first test of the corrector showed 
that the resolution of a TEM with this corrector reached 1 A. 


7.4.2 Aberration Correction in PEEM and LEEM 


Due to the low energy of the electron beam (« 30 keV) in these devices, 
so-called electrostatic lenses that use electric fields to focus the beam are 
feasible. Although the multipole corrector used in low voltage SEM (scanning 
electron microscope) successfully corrected the spherical and the chromatic 
aberrations, it is not suited for PEEM (photoemission electron microscope) 
or LEEM (low energy electron microscope) which requires large field of view. 

A sophisticated multipole corrector similar to the TEAM (Transmission 
Electron Aberration-corrected Microscope) corrector may be sufficient, but 
as it turns out there is a much simpler alternative, namely the electrostatic 
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FIGURE 7.18: Geometry of the tetrode mirror in PEEM3 at Lawrence 
Berkeley National Laboratory, California, USA. 


FIGURE 7.19: Layout of PEEM3. Square: beam separator, ellipses: elec- 
trostatic round lenses, the mirror is on the bottom. 


mirror. The reflection in the mirror makes it possible for a mirror to generate 
spherical and chromatic aberrations with the opposite sign of those from the 
regular round lenses. 

The top picture in Fig. 7.17 shows that an electron with large initial angle 
is reflected at a location where the slope of the field line is smaller than the 
initial angle and can be focused less. The bottom picture shows that an 
electron with higher energy penetrates deeper into the mirror, is reflected at 
a location where the slope of the field line is larger than that for an electron 
with design energy and, as a result, can be focused more. 

Therefore, an electron mirror with a dent on the reflection electrode com- 
parable to the electron beam size can form the desired field distribution for 
aberration correction. Fig. 7.18 shows an example in PEEMS3 at Lawrence 
Berkeley National Laboratory, California, USA, which is an adaptation of the 
SMART design, and the layout of PEEMS3 is shown in Fig. 7.19. SMART 
is a project of SpectroMicroscopy for All Relevant Techniques in Germany. 
The dots behind the surface denote charge rings used for numerical simula- 
tion. The first electrode from the right physically ends roughly at z — 33 mm. 
'There are four electrodes used to provide tuning of the focal length, the spher- 
ical, and the chromatic aberrations. 
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FIGURE 7.20: The beam separator of the first aberration-corrected 
PEEM. The path of the reference electron is shown by the dash-dotted curve. 
(From H. Rose, Geometrical Charged-Particle Optics, Springer-Verlag, Berlin, 
2nd ed., 2012, © Springer-Verlag Berlin Heidelberg 2009, 2012 [60]. Chap- 
ter 9, Correction of Aberrations, Fig. 9.12, “Cross section of (a) the fourth 
quarter of the beam separator showing the double symmetry of the fields and 
the curved optic axis and of (b) the entire separator. The shaded areas repre- 
sent the regions of the dipole field perpendicular to the pole plates. The sign 
and the strength of the dipole field differ for regions with different shading; 
the dash-dotted curve represents the optic axis.” With kind permission from 
Springer Science and Business Media.) 


Although the electron mirror itself maintains the rotational symmetry, a 
magnetic beam separator is needed to guide the electron beam to the detector 
downstream of the mirror, thus breaking the rotational symmetry of a con- 
ventional PEEM. Consequently, the most challenging part of an aberration- 
corrected PEEM/LEEM is the beam separator whose own aberrations have 
to be small compared to the existing ones. 

The first aberration-corrected PEEM was built at Technische Universitat 
Darmstadt, Germany, in the 1990s and was installed at BESSY II, the Berliner 
Elektronenspeicherring-Gesellschaft fiir Synchrotronstrahlung in 2001. Re- 
cently it achieved a resolution of 3 nm. 

Its layout is similar to that shown in Fig. 7.19 up to the exit of the beam 
separator since the former has an energy filter downstream of the beam sepa- 
rator. The mirror column forms a so-called —/ transport, which ensures that 
the cosine-like ray turns back on the axis and is unaffected, maintaining a 
large field of view. The beam separator shown in Fig. 7.20 is a square magnet 
with 90? bending and three axes of mirror symmetry (0 — 27.5?, 45? and 
62.5? for each pass). The resulting optical system is an achromat with +I 
transport, i.e., its transfer matrix is an identity matrix, and it is free of all 
second order geometrical aberrations. 


The drawback of this separator is the difficulty in building this device to 
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FIGURE 7.21: The PEEM3 beam separator. The square, the ellipses and 
the rectangles represent the magnet, the electrostatic round lenses and the 
electrostatic quadrupoles, respectively. 


FIGURE 7.22: Sine-like, cosine-like and dispersive rays of the PEEM3 
beam separator in the z-z plane (left), and in the y-z plane (right). 


the tight machining tolerance and in tuning it during operation due to the 
complexity and rigidness of the design. The fact that focusing is produced 
primarily by the edges entails that the slope of the grooves and the details of 
the field near the electron path are critical to the quality of the image. 

The selection of high permeability material to fit the field distribution to 
the analytical model leads to magnetic material which is very soft and thus 
difficult to machine. As a result, the second project of an aberration-corrected 
PEEM, built at Lawrence Berkeley National Laboratory, turned to a simpler 
separator design shown in Fig. 7.21, and the rays are shown in Fig. 7.22. 
Since the magnet is a simple 90? sector bend, round lenses, with the help 
of electrostatic quadrupoles, provide the focusing. There is only one axis of 
mirror (0 = 45?) for each pass. The system is a so-called —J transport for 
each pass with no zero dispersion at the end. An achromat is formed after 
two passes. 
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FIGURE 7.23: Aberration-corrected and energy-filtered LEEM at IBM. 
Images of the sample are formed on the diagonal lines of the prisms, which are 
shown by squares. (Reprinted from Ultramicroscopy, v. 110, R. M. Tromp, et. 
al., A new aberration-corrected, energy-filtered LEEM / PEEM instrument. 
I. Principles and design, p. 852-861, Copyright (2010), with permission from 
Elsevier [68].) 


Between 2007 and 2011, a third aberration-corrected PEEM/LEEM was 
designed and built at IBM [68]). The layout is shown in Fig. 7.23. Magnetic 
prisms are represented by squares, and electrostatic and magnetic round lenses 
are represented by ellipses. Fig. 7.23 also shows cosine-like rays (drawn in dark 
gray, denoted “field ray") and sine-like rays (drawn in pale gray, denoted “axial 
ray") from the sample. The beam separator consists of two 90? prism arrays 
and an electrostatic round lens. It restores the double mirror symmetry of 
the first separator but greatly simplifies the design and manufacture by using 
only commercially available components. The prism behaves like a round lens 
to the first order and is mirror symmetric, which entails that the dispersive 
ray forms a virtual image at the center. As a result, one round lens between 
the two prisms is sufficient to make the separator an achromat and transfer 
the image from the center of the first prism to that of the second one. 
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Chapter 8 


The Periodic Transport 


In the case of the periodic transport over long distances, the desire is not 
so much to give a special shape to the beam as the beam exits, but, even 
much more simply, to just contain the beam. This is of key importance 
in all devices in which the beam repeatedly passes through the same (or a 
very similar) structure. We may wonder whether this again translates into 
the requirement that a certain matrix element vanish, but as we shall see, this 
is not quite the case. 

Actually it is rather straightforward to formulate a necessary condition on 
the linear matrix: it is not allowed to have any eigenvalue of magnitude greater 
than unity. If the eigenvalue is real, the argument is simple: if this were the 
case, any particle that has its coordinates lined up with the corresponding 
real eigenvector will after one period end up on the same line, but all its 
coordinates would have increased by a factor equal to the eigenvalue. 

If on the other hand the eigenvalue is complex, there is another eigenvalue 
that is conjugate and hence has the same magnitude. Similar to the eigenval- 
ues, also the eigenvectors are conjugates of each other. Now simply consider 
the sum of the two eigenvectors, which is real; sending this sum through the 
matrix multiplies the first eigenvector by the first eigenvalue, and the second 
one by the conjugate, resulting in a sum that is again real and increased in 
size by the magnitudes of the eigenvalues. 

In both cases, coordinates grow exponentially in time, and so eigenval- 
ues that are even only a tiny amount above unity in magnitude are detrimen- 
tal. Of course the nonlinear effects also influence the motion and break the 
purely exponential pattern, but all experience shows that it is not possible 
to correct linear instability with nonlinear means; in practice, things usually 
work quite to the contrary. 


8.1 The Transversal Motion 
8.1.1 The Eigenvalues 


Because of emittance preservation due to Liouville's theorem, the fact that 
eigenvalues greater than unity are prohibited means that, in fact, all eigen- 
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FIGURE 8.1: Motion in phase space (left) and in the eigenspace (right) 
when |tr M| > 2. 


values have to have unit magnitude. Of these, the cases +1 and —1 are 
to be excluded too, since even the slightest imperfection in the machine may 
otherwise lead to instability. Altogether, in a periodic system, the eigenvalues 
must all be complex and of unit magnitude. 

It is particularly interesting to study the special case of a matrix with 
midplane symmetry. In this case, the x and y motion decouple and can be 
described by individual matrices. We obtain for the eigenvalues for the 2 x 2 
x sub-matrix, noting that the y sub-matrix is treated similarly: 


(rjr) -À (ala) 
(alm) | (aja) = À 
= (z|x) (ala) — (ala) (alz) — ^ [(x|x) + (ala)] +”, 


0=|M- All = 


and so 


Hence to have complex eigenvalues requires the very simple condition 
—2 « tr M <2. 


A quick check of the four cases shows that this excludes the point-to-point 
case and the parallel-to-parallel case, as in both of these, the trace just equals 
two or exceeds two. The parallel-to-point or point-to-parallel case each have 
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FIGURE 8.2: Relation between the phase space variables and the eigen- 
vectors. 


one element on the diagonal vanish, so they are permissible if the remaining 
diagonal matrix element is less than two in magnitude. 

We also verify that for tr M Æ 2, the eigenvalues form a reciprocal pair, 
i.e., AVAg = 1. Let us quickly revisit the case | tr M| > 2, for which the 
eigenvalues are real, and hence one of them is greater than unity, and as we 
had already concluded, the motion is unstable. Choosing a new basis, the 
so-called normal form basis, along the real eigenvectors vı and U2, we have 
that the repetitive motion asymptotically approaches the eigenvector yı with 
eigenvalue greater than unity and becomes larger and larger (see the right 
picture in Fig. 8.1). 

A detailed analysis shows that the motion indeed follows a hyperbola; 
note that A;À2 = 1, and that |A1| > 1 > |A2|. Suppose we have a general 
vector expressed in the basis (71,02) as shown in Fig. 8.2 whose coordinates 
are now o and f (not to be confused with the Twiss parameter), and thus 


fs T a - 
T= @ = av, + vo. 


Applying the transfer matrix, we have 
MZ = aMt, + BM b> = at, + Brod. 


In normal form coordinates, the action of the transfer map is thus given by 


x) = (0 X). 


but since Ag = 1/1, the product of the coordinates stays constant, character- 
istic of the motion along a hyperbola. In Cartesian coordinates, the motion 
looks more complicated as the hyperbolic structure is deformed (see the left 
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FIGURE 8.3: Relation between the eigenvalues when | tr M| < 2. 


picture in Fig. 8.1). For practical purposes, this case is unstable and hence 
useless. 

Let us now consider the case |tr M| < 2 in more detail. We have the 
complex eigenvalues that satisfy 


À2 = M and AQ = AS 


So in the complex plane, A; and A; lie on a circle and form conjugate pairs, 
as shown in Fig. 8.3. 
The eigenvalues can hence be written as 


c ost 
À1,2 =e ; 


where p is called the tune of the system, which is 


(> + &) (s 
jt = arccos = = arccos 2 . 


The eigenvectors vj,» belonging to A1,» also form conjugate pairs, since 


Mv M 05 A205 A109. 


Define now two new basis vectors V, = Re (v1), J. = Im (t1) as the real 
and imaginary parts of the eigenvalues; they define what is called the normal 
form basis for stable motion. So we have 


We now observe 


MV, —AX10 = e (U, + iU.) 9 cosu-V, —sinp-.. +i (sin p: U, + cosu: U), 
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FIGURE 8.4: Motion in phase space (left) and in the eigenspace (right) 
when | tr M| < 2. 


and similarly 
Ma = Aaa = e" (U, — iU.) = cos 0, —sin pU —i (sin y: V4 + cosu: U). 


Now assume we have a general vector expressed in the basis vectors 04 with 
coefficients a and f, i.e., X = ov, + BU... Then we have 


Mz = aM@, BM. 
= np t® | gy b= e _ Mi + Mie , pita - Mi 
2 21 2 20 
=a (cos u: 9, — sin p- 0.) + 8 (sin u- 04 + cos u - V) 
(acos u + B sin y) V4. + (Casin u + B cos u) V... 


II 


So altogether, in normal form coordinates, we have 


^ 3 z ( oW cw E ( COS Li A t3 
M = : = : 
B —asin u + B cos u —sinp cos, B)’ 
and thus the transformation M simply performs a rotation as shown in the 
right picture in Fig. 8.4. 

The angle of the rotation in normal form coordinates is simply equal to the 
tune u; and it is completely obvious that the motion is stable. 

To obtain the motion in the original Cartesian coordinates, we have to 
subject the circles to a linear transformation, which turns them into ellipses; 
so the motion looks as in the left picture in Fig. 8.4. The angle by which 
particles move in the original x, a coordinates is not necessarily u anymore; but 
we can conclude that indeed if we look at the average angle advance over many 
turns, then this average converges to the tune p, as at least the number of full 
revolutions that were experienced must agree in both coordinate systems. 
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FIGURE 8.5: Possible movement of the eigenvalues under small perturba- 
tion near | tr M| = 2. 


It is also very illuminating to consider what happens if the system is sub- 
jected to some small errors, which in reality of course always appear. If the 
eigenvalues were far enough from unity, even under small errors we still have 
Ay = Az and A, = AD and while the tune may have changed a little, the 
qualitative behavior of stability is totally unaffected. So as long as we main- 
tain that | tr M/2| < 1 is maintained, stability prevails. If on the other hand 
the perturbation is so large that this is violated, the perturbation can lead to 
the loss of stability as shown in Fig. 8.5. 


For the sake of completeness, let us consider further the case of |tr M| — 2. 
In this case, 41,2 = 1, and 


M - +I. 


'This motion is stable; but under the slightest perturbation there is danger of 
becoming unstable, and hence this case is practically useless. 


8.1.2 The Invariant Ellipse 


For many practical purposes it is particularly important to know in detail 
the parameters of the ellipse that is invariant under stable linear motion. For 
this purpose, let 41,3 = e*‘“, and choose the sign of the tune u such that 
sign(u) = sign((r|a)). We then define three parameters o;, 3; and 7; as 


(z|a) — (ala) 


xja alx 
OOF add gee): i (ale) 
sin Lui sin Lu; 
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As we shall prove now, these three parameters describe the invariant ellipse 


via 
T 
x i Yi Qi . T E 
a e 1) (;) l 


where the matrix describing the ellipse is called T. To prove that T is actually 
invariant, we first express the transfer matrix in terms of the parameters. 'To 
this end, we observe that since 


tr M 
2 


Ài2-— 


we have that 
(z|x) + (aja) = tr M = Ay + àz = e" + e-*^ = 2 cos u. 
From the definition of a;, we have (x|z) — (ala) = 2sin p; - a;, and hence 
(a|x) = cos u; + aisin ui, (ala) = cos ui; — a; sin pi. 
On the other hand, from the definitions of 8; and yi, we have 
(zla) = fisinps, (ale) = =y: sin yu, 
and so altogether 


M COS Hi + Q; Sin pi Dj sin pi 
v — yi sin pu COS Hi — Qi SÌN py J ` 


BV ISON Beno aes d 
Z aar 


M = Î cos p; + K sin pi. 


Letting 


we have 


Computing the inverse map of M , we find 
M! = Î cos ui — K sin p, 


where we used |M| = 1, and as a consequence piyi — a? = 1, which we infer 
as follows: 


1= | M] = (cos Hi + Qi sin pi) (cos Hi — Qi sin ki) + Bidi sin? Hi 
= cos? p; + (Biyi — o2) sin? u; = 1 + (—1 + biyi — o2) sin? mi, 


but since u; was not allowed to be zero or m because of our requirement of 
stability, we must have iyi — a? = 1. 
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We now are ready to study whether indeed the ellipse defined above is 
invariant under M. This is the case if whenever a particle satisfies the ellipse 


equation 
x T Yi Qi . x EN 
a Qi B aj ” 


their image under M , which is given by 


also satisfies the ellipse equation. This means that also 


p 
p] GA) vr QI 
a Qi Bi a 
This is the case if and only if 
MT.T.M-T, 


since every ellipse is described by a unique symmetric matrix and MT.T.M 
is indeed symmetric. In order to execute the matrix multiplications necessary, 
we study various matrix products; let 


We then have 


Now we are ready to compute the product MT . T. M. We obtain 
MT.T.M- (Î cos pi + ÉT sinp) T (Î cos pi + Ksin ui) 
= (i COS Lui + KT sin mi) (7 COS pi + Jsin mi) 
— T cos? hi + J sin pi COS LL; — J sin ui COS LL; + T sin? li = T 


which is indeed what we needed to prove. To conclude we remark that there 
is not only one invariant ellipse, but even every ellipse that can be generated 
by stretching or shrinking from the original one is invariant. So altogether, we 
have a nested set of invariant ellipses, and particles will always stay contained 
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a 


2) 


FIGURE 8.6: Stable motion in phase space. 


on the invariant ellipse on which they are originally lying, as shown in Fig. 
8.6. 

The last important question remaining in this section is to put into per- 
spective the parameters of the beam a, 0, y and the parameters a;, Bi, Yi 
describing the invariant ellipse of one turn accelerator. Are these Greek let- 
ters equal, are they related, or do they have nothing to do with each other? 
This is actually a question that often throws off even die-hard accelerator 
physicists, and it is very much worthwhile to understand it in depth. 

Regarding their origin, these two sets of parameters are actually totally 
independent. In fact, one describes some property of an accelerator, and 
the other describes a property of a beam; and of course we can feed any type 
of beam into a given accelerator. 

However, if the goal is to fill the accelerator in the most efficient way, as 
it turns out this is accomplished if the beam's Twiss parameters agree with 
those of the accelerator. In this case, after one revolution the phase space will 
occupy exactly the same area (although the individual particles in it are at 
different positions), as shown in Fig. 8.7. 

On the other hand, if one injects a beam with an ellipse that does not agree 
with the invariant ellipse of the accelerator, then the repetitive behavior of 
the beam ellipse shown solid in Fig. 8.8 is determined by the shaded invariant 
ellipse it touches. 

As we go around the repetitive system repeatedly, the beam ellipse stays 
within the invariant ellipse and touches it, but, depending on the tune, will 
have a different orientation. In fact, if the tune is not rational — something 
desirable for stability reasons — over time even all different orientations will 
occur. If we now want to operate the accelerator, we have to make sure we can 
handle everything inside the invariant ellipse, leading to considerable waste 
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a 


FIGURE 8.7: Illustration of the case where the beam ellipse (dashed) and 
the invariant ellipses (solid) are matched. After each revolution, the beam 
ellipse is exactly reproduced. 


of area. 


So it is best to operate a repetitive system in such a way that the beam 
ellipse is matched to the accelerator’s invariant ellipse, and to avoid mis- 
matching, the so-called beating. 


8.2 Dispersive Effects 
8.2.1 The Periodic Solution 
Let M be the transfer matrix of a periodic cell, 
R (x|x) (|a) (2/6) 


M = | (ala) (ala) (alà) 
0 0 1 


The periodic solution characterized by D, D’ satisfies 
D (z|r) (ala) (zo) V / D 


D' | = | (alz) (ala) (aj) | | D' |. (8.2) 
1 0 0 1 1 
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Q I 


FIGURE 8.8: Behavior of a mismatched beam. 


Thus 
COR O eo 
OE ica). (is) 

when 


1—(alz) -(xa) V - — (x|x) — (ala 
act (1 o) =2- (ala) = (ala) #0. 


Note that when tr M < 2, which is satisfied if only stable motion is considered, 
D, D' are uniquely determined. 


M 1 1—(aja) (ala) (2/6 
(>) — 2 — (zl) = (ala) ( (als) 1- 6) (t 

1 bo — (ala)) (#16) + ae 

2—(a|x) — (ala) \ (1 — (ælļz)) (alô) + (ale) lð) 7” 


From the point of view of the computation, a slightly different form of the 
same result helps to develop an algorithm that can be applied to arbitrary 
order with the help of the Differential Algebraic (DA) technique. Eq. (8.2) 
can be rewritten as 


100 D 0 (a|x) (ala) (a|8) D 
010 D'|+{0) =| (alx) (ala) (ald) D' |, 
000 1 1 0 0 1 1 
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and hence 
D (zlz)—1 (zla) (2/3)\~" (0 
D' |= (alx) (ala) — 1 (ald) 0 
1 0 0 1 1 
Defining 
. 100 
ig2[010], 
001 


we obtain the more compact form 


D 3o visu 
D' IUE) 0 
1 1 


It is straightforward to generalize the above equation to the case of a nonlinear 
map. Here the equation to be solved is 


D (6) D (6) 
D'(6) | =Mo} D'(3) |, 
5 5 


where D (0) and D’ (6) are polynomials of 6 without the constant part. Similar 
to the case for the linear map, we obtain 


D (6) D (6) 
D'(6) | -(M-Zu) o| D'() |, 
ô ô 
where 
100 Ë 
Ty =|010 a 
001 ô 


As described in Section 5.4.2, the map (M — Ig)! can be obtained using the 
Differential Algebraic (DA) technique order-by-order up to any given order. 
Thus the periodic solution of dispersion up to arbitrary order can be obtained 
without loss of accuracy. 


8.2.2 Chromaticity 


Now let us turn our attention to the betatron tune of off-momentum par- 
ticles. We know that higher momentum particles are bent less, hence focal 
length of quadrupoles is longer. As a result, we expect that total phase ad- 
vance decreases as momentum increases. Recall that 

_ 40B; 
~ p Ox’ 
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and when p — po 4- dp, we have 
|. q00B, q OB, 1 


= +- — =, kg (1-56 

p Ox po Ox 1+ dp/po 1 Ko ( ^ 
q OB, dp 
Rire M pu 
po Ox Po 


For a small distance ds, we have 


A i. 26 
E e 3 


^ 1 0 
cm mus 1) | 
we have 


^r apyg- (| 0890) + o sin(uo) B sin(uo) 
DA ( -7sin(uo)  cos(po) — EN) 
_ ( 1 a p + a sin(uo) B sin(uo) ) 
— (k — ko) ds 1 —ysin(uo) cos(uo) — a sin( Ho) 


E ( 1 D Gee + a sin(uo) B sin(uo) ) 


ókods 1 —ysin(LU0) cos(uo) — a sin(uo) 


And by defining 


Using the normalization transformation 


BOI M MS NE. 
o/V/B VB -aj/B 1/VB} 


we obtain the transfer matrix in the normalized space as 


M = ÂM Â`! 


1 0) 4-1. 4 ( eos(uo) + asin(uo) B sin(ug) is 
A 1) aS —ysin(uo) cos( uo) — on) A 


=A 
cos(uo) sin(p0) 
au i —sin(uo) cos(uo) 
= cos(Ho) sin(uo) 
~ VóBkods cos(uo) — sin(ug) cos(uo) + óBkods sin(jo) 


and 


cos(u) = pu (ar) = cos( Ho ) + L ökods sin(uo), 


1 
u = po + du => cos(u) = cos( uo) — du sin( uo) => du = — 39Bkods 


du 1 dv 1 
= — = — — = ——— = —— k d a 
= dv = g 9Pkods => En D aL o(s)ds 
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The last step keeps only the leading order effect. The quantity En is called 
natural chromaticity. 

It will likely lead to beam loss since certain off-energy particles lie on reso- 
nant tunes. To remedy this problem, it is common to use sextupoles, because 
they do not affect tunes of on-momentum particles but at the same time 
provide quadratic nonlinearity that can be used to compensate the natural 
chromaticity. The magnetic field of a sextupole is 


By = b2 (a? — y’) , By = 202xy, 


where bz = —3M3,3. Defining ks = —b2/xmo and denoting 


COS [le +QzSiN [be By SIN [be 0 0 
^ —^ys SIN Hg COS [ly — Oi Sln [be 0 0 
M — . : , 
0 0 COS JL, +QySiN fly By sin ty 
0 0 —Yy SİN fy COS fly — AySIN fly 
we obtain 
X1 X To 
2 2 
ai | _ a+ keds (x - y?) EIE ao 
yi y yo 
by b — 2k,dsxy bo 
x (cos Ha 4- oc; sin fly) zo + (Bx sin Hz) ao 
a+ keds (a? — y?) — (Ye sin Hg) xo + (cos ui — o, sin Hz) ao 
O 
y (cos fly +aysin py) yo + (By sin Hy) bo 
b — 2k,dsxy — (ny sin Ly) yo + (cos fy — a sin ty) bo 


Again, to make the physical picture clearer, let us apply the normalization 
transformation in four-dimensional space, which is 


c l/V/B. 0 0 0 x 
a ET. a Ar / VB vB 0 0 a 
y y 0 0 IB, 0 y 
b b 0 0 ay/V/By V/By/ Nb 
The inverse is 

JV Bx 0 0 0 

Ac E —ag/ V Ba 1// B. 0 0 

7 0 0 a) By 0 

0 0 —oy/ / By 1/ / By 
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Denoting the linear normalization transformation (as opposed to the trans- 
formation matrix), its inverse and the linear transfer map as 


c x B 
A-Á|*|, At=A11"), M-HM|7t|, 
y y y 
b b b 
we obtain 
Ti HH 
a ksds (x? — y? 
n Pot a+ ksds (x? — y?) o A71 o[AoMo A`] 
Uu y 
hb b — 2k,dsry 
x 
a + ksdsy bx Bar” — f y 
= : id ) o R(Hs, Hy), 
b= 2ksdsy Pxbyxy 
where 
Ê(pa) 0 r 
Lex a ^ cosu sin 
R(us; Hy) = | m y |? R (uz) = é i 2 l 
R (py) b 


Since the kick of a sextupole is second order in the coordinates, the off- 
momentum particle has to go through it off the magnetic center in order 
to affect the linear motion and hence the tune. In other words, the dispersion 
has to be nonzero at the location of the sextupole to correct chromaticity. 
With the presence of dispersion, the coordinates are 


+ 
x —>zx-— D,ô, a— a- D,ó, 
and the normalized coordinates are 


à ü — VBD, ô. 
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The one turn map in the new normalized coordinates is 


DN, Jato Da x z— Br? Dô 
x i 
ay _| a+ 82 Dô E at+k,dsBz (Bax? = Byy?) "E HI 
Yı y y y 
i b b — 2k,dsf2 By xy b 
z + 6z? Dzô z— 6z? D,AV (o 
Eg doy a 
o| @+ 82 D,ó oR(us uy) | 57 2D,6 |,| °° 
y y Yo 
b b bo 
zr To 
2 as 
a+ ksdsv/Bs [Br (2 — Daô/ VBE) — Byy?] ag 
2 R(us, My) 9| ~ 
y Yo 
b — 2k,ds /Bs By (x — Dzô/ VBa) y bo 


'The reason that the constant part vanishes after the rotation is that D, and 
D,, are periodic solutions of the ring. Keeping only the linear part, we obtain 
the transfer matrix 


1 


M = 26k,ds | 6zPs 


0 0 
0 0 ^ 
0 1 


oor © 


By D 
and the chromaticities due to the sextupole 
1 
Geo qe Gels) Da(s)he( ss, 
1 
Lys = oae d By(8)Dx(s) bos) 


In summary the total chromaticities are 
fx = EL [ks (s) m D,(s)ks(s)| ds, (8.4) 
fy = ELI [ky (s) + D«(s)ks(s)] ds, (8.5) 


where k,(s) and k,(s) are quadrupole strength in the x and y planes along 
the ring. Usually two families of sextupoles are used to correct chromaticities 
in both planes. In order to make the two knobs more efficient and orthogonal, 
one family is placed at locations so that 6, is large and 8, is small and the 
other family at locations so that the opposite is true. 


The Periodic Transport 205 


Eqs. (8.4) and (8.5) are very useful for the design process to determine 
where the chromaticities are generated and where the best locations are to 
place the sextupoles for correction. Yet computing them together with the 
higher order terms become almost trivial with the Differential Algebraic (DA) 
technique. Recall that the tunes are given by 


il: tr M, y 
Vry = m arccos lc . 


The ô dependent tunes are simply 


which contain the tunes and the chromaticities to arbitrary order. For exam- 
ple, vz, are the constant part and £;,, are the linear coefficients. 


8.3 A Glimpse at Nonlinear Effects 


Linear motion around a fixed point is completely classified by the two cases 
we discussed previously, namely the stable or unstable case. This situation 
is fundamentally different in the nonlinear case; it is in fact much 
more complicated and interesting, and represents one example of the modern 
research field dealing with just such questions. 

While this is not at all the place to try to develop a complete understanding 
of the nonlinear effects that may appear, let us spend some time to stake the 
territory and make some general observations. First we may expect that as 
long as the motion is close enough to the fixed point, it is dominated by 
linear effects, and depending on whether we have stability or not, we see 
either stable elliptic motion or unstable hyperbolic motion. While we may 
expect that linearly unstable motion will in most cases also stay unstable if 
we consider the nonlinear effects, linear stable motion will not usually 
stay nonlinearly stable. In fact, if the amplitudes of the motion become 
large, the effects of nonlinearity will become noticeable over-proportionally, 
and eventually they will become dominating, in most cases leading to insta- 
bility for large amplitudes. 

One can then try to heuristically separate the phase space into a region that 
appears stable for a reasonable number of turns, and a region that appears 
unstable. According to the previous arguments, in most cases the stable 
region will be near the fixed point, and the unstable region will be away from 
the fixed point. The region of transition between the apparently stable and 
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apparently unstable parts is usually called the dynamic aperture, and it 
often looks like a deformed ellipse. 

Let us now study a little what conditions seem to favor stable or unstable 
motion, respectively. If we divide the phase space regions into parts in which 
the nonlinear effects have a tendency to pull particles away from the origin 
and those that tend to push the particles toward the origin, then we may 
expect that we want to avoid situations where the particles spend too much 
time in the “pull away” regions, and it is better if we sample the phase space 
uniformly, and thus average out the effects as much as possible. 

A nearly uniform sampling of the phase space happens if the linear tune is 
not a rational multiple of 27. On the other hand, if the tune is of the form 
Lli = 27p/q, after q turns the particle will come back to where it was before 
and hence can see the same effect, a situation which we call resonance; so 
it is at least not a good idea to choose q too small, as repetition after large 
numbers of turns is not as critical. The effect of resonances in a circular 
accelerator is of great importance to its performance. Chapter 11 is dedicated 
to studying this topic in detail. 

We may also wonder to what extent it is possible to perform a transfor- 
mation to normal form coordinates in a similar manner as in the linear case. 
As it turns out, most systems cannot be brought to a normal form in which 
the motion is exactly circular; the existence of such a transformation is tanta- 
mount to the system being integrable, i.e., having one integral of motion per 
phase space dimension. Truly integrable systems, however, are very rare. It 
turns out, however, there is a powerful order-by-order iterative procedure to 
turn a system into nonlinear normal form up to any given order [5]. A simple 
example of this procedure is given in Section 11.4. 


Chapter 9 


Lattice Modules 


In the design of actual devices for the manipulation of beams, it is impor- 
tant to employ field arrangements that achieve the basic features of steering 
the beam as a whole to its desired location, as well as keeping the beam 
close together over possibly extended distances, which is achieved through 
various focusing mechanisms. Thus in both single pass lines and rings, there 
exist different sections that perform different functions which require differ- 
ent types of lattice modules. Modern accelerators and beam transport lines 
focus the beam transversely using alternate-gradient focusing, also called 
strong focusing, which evolves from the so-called weak focusing used in 
betatrons and early weak focusing synchrotrons. In the weak focusing ma- 
chines, inhomogeneous dipoles were used to bend and transversely confine the 
beam simultaneously. This is possible due to the fact that an inhomogeneous 
dipole with 0 < n < 1 focuses the beam in both x and y planes. 
From eq. (4.6), we have, for 0 < n < 1, 


x | cos(/1 — n) (p/V1—nm) zu) 
"o \ = (VI- n/p) sin(V1— no) cos(VT— nio) | 
ü -( cos( yng) (p/ /m) v 
" X-(Vn/p)sin(vnó) cos(/nó) l 


Yet weak focusing was eventually replaced by strong focusing because weak 
focusing was too weak to confine the high energy beam. Here is an example 
of the Tevatron at Fermi National Accelerator Laboratory (Fermilab, FNAL), 
Illinois, USA. 


Bo ~4T, E= 10° GeV, P = 10? GeV/c. 
P Pe dE 10x10? 


(T CONG dE aea 0 1225 
For n — 1/2, we have, 

ðB, Bo 14 E 

DUM n2 wv 25x 10T 

aea ED a 


Bugs = 2.5 x 107°- 5 x 107? = 1.25 x 1074 T. 
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FIGURE 9.1: Sketch of a FODO cell without bending magnets. 


Yet for quadrupoles, B|,-, ~ 1-3 T. Thus we conclude that weak focusing 
at the 1 TeV of the Tevatron is four orders of magnitude weaker than strong 
focusing. 


9.1 The FODO Cell 


The most common form of strong focusing module is the so-called FODO 
cell, where two quadrupoles of opposite polarity (focusing (F) and defocus- 
ing (D)) are separated by drifts or homogeneous dipole magnets (O). Due 
to it simplicity, we can easily derive the transfer matrix of such a FODO 
cell. To simplify matters even further, we choose the center of the defocusing 
quadrupole as the start and the end of the cell and use the thin lens model of 
the quadrupole. First, let us consider a FODO cell without bending magnets, 
as shown in Fig. 9.1. The total transfer matrix of the horizontal plane is 


te= Gun 1) (0 1) Cs t) Quis 1) 
[Cree IO t) n 2) onn YG t Guns 1) 


= 1— d/2fi d 14+ d/2f5 d 

— V/2f5—1/2fi- d/Afi fo. 14 d/2f2A1/2f5—1/2fi— dAfife 1— d/2fı 

_ 1— d/ fy d/ fa— d? /2 fi f2 2d (1— d/2 f1) 

AWWA A fi- d/fifet d/2 fi PAR fI. 1— d fie d/ fa- 2h} 
Hence we obtain the transverse tunes, which are 


cos (Ur) = 1 Dace g cos ( EE i d 
L2 hf» Afif? oam hf 2hf 
Note the second equation above is obtained through interchanging fı and fo. 
The shaded area in Fig. 9.2 shows the range of fı and fo where motion in 
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-d. 
2fi 


FIGURE 9.2: The “necktie” diagram showing the stability region of a 
FODO cell. 


iQr B QD B iQF 
FIGURE 9.3: Sketch of a FODO cell with bending magnets. 


both planes are stable, i.e., |1—d/f1+d/fo—d?/2fi f| € 1, |V4- d/ fa — d/ f2- 
d? /2 f, f?| < 1. Because of the shape of the region of stability, this and related 
similar figures are often referred to as a necktie diagram. 

When f = fi = f2, we have 


d2 
cos (us) = cos (ty) = 1- zog. 
and the condition for stable motion is 
2 d 
hi <1= -1 <1- a= 0S s aso ped 


From the geometrical point of view, when f < d/2, the particle that is parallel 
to the optical axis at the center of the defocusing quadrupole at the start of 
the cell would cross the axis before the center of the defocusing quadrupole at 
the end and bend further away from the axis. The transfer matrix for f = d/2 
is —I , so the maximum phase advance of a cell is 7. 
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Next, let us consider a FODO cell with bending magnets, as shown in 
Fig. 9.3. Here we assume that the drifts between magnets are negligible. 
Furthermore, the bending angle 0 is small. Since / is nearly a constant, the 
transfer matrix of the dipole is 


, cos 6 psin@ p(1-— cos) 1 l 10/2 
M-—|-(1/p)sinü cos0 sin 0 —-2[01 6 |, 
0 0 1 00 1 


where | = p is the arc length of the dipole and 0/p — l/p? « f {l<2f,p> 
f} 

Note that the factor (1 + no) / (2 + 70) does not appear due to the fact that 
dp/po is used. At an energy that yo > 1, (1 -- o) / (2 - no) S 1, and the 
difference between dp/po and dK/ Ko becomes negligible. The matrix of the 
cell is 


1 0 0X/1 I 16/2 100 1 l 10/2 1 00 


M, =|—1/2f 1 O||0 1 0 1/f10[|[01 8 ||-12f1 0 
(XE 1J/\00 1 Uu 1J/\0 0 d 0 11 
1—42/2f?  9(--i/2f) 210 (1+1/4f) 

=| —l/2f?+/4f? 1—-DPJ/2f? 20(1-1/4f — /8f?) |, (9.1) 
0 0 1 


and hence we obtain 

LM dues usc: 21 [1 + sin( {tz /2)] 
2f? sin(j) 

a, = 0 = > B, — 0, focusing at the ends => f, = bemar: 


cos(u;) = 1 — 


Note that when QF and QD (see Fig. 9.3) have the same strength, there is 
symmetry between the two planes. The transfer matrix of the vertical plane 
can be obtained through changing the sign of the focusing, i.e., (f— > — f). 
Furthermore, the transfer matrix of the cell that starts and ends at the centers 
of the defocusing quadrupoles can be obtained the same way. The results are 
summarized below. 


2l [1 — sin(u/2 
My = Ha, ymas = zmax? yan = Bamin = LL 
Note that //f determines p, and l determines 8, and 8,,, when p is fixed. 
With the upper limit of the magnetic field B of the dipole, which give the 
upper limit of 0, thus the lower limit of the number of cells, the lower limit of 
the size of a ring can be obtained. (The upper limit of B is roughly 1.5 T for 
warm (normal conducting) magnets and 8 T for superconducting ones.) 


min 
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Similarly, we have 
10 [1 + (1/2) sin(u/2)] 


sin? (u/2) 
DL-0, DL-0, 


_ l8 [1 — (1/2) sin(u/2)] 


Dinak = D Dyin = - 2 
sin? (11/2) 


, 


where the subscripts F and D denote the centers of the focusing and defocus- 
ing quadrupoles, respectively. When u = 90°, we have 


l 
= 


Be (2 + v2) EE (2 - v2) l, M —342V2- 5.8, 


Bg 5 (4 n v2) 10, Dmin = 5 (4 E v2) 10. 


As an example, let us consider the Main Injector FODO cell at Fermilab, 
which has the parameters 


l= 17.2886 m, p= 


SIE 


, 


min 


Baas = (2+ V2)1~ 59.0 m, Buin = (2- V2) 101 m, 


and which is shown in Fig. 9.4 [33, 21]. The magnetic field and length at the 
momentum of 8.9 GeV/c are 


B=0.102T, ,=6.096m, ym=29.69Tm, p= » = 291 m, 


2 
0= a = 41.9 mrad (2 magnets per half cell), 
p 


; (4- v2) 10 = 0.94 m. 


1 


Dyax = 
2 


(4+ v2)16 — 196m, Dmin = 
It is worth noting that the values of ba and Dmax are very close to those 
obtained from the exact model, which are 58.2 m and 1.95 m, respectively. 
This shows that the thin lens model is rather accurate for a typical FODO 
cell. 

Now let us look at the size of the beam. The Fermilab Main Injector 
is designed to accelerate proton and anti-proton beams of emittance up to 
407 mm mrad. Following the Fermilab convention, the emittance is defined as 
EN = berms BY, (B = v/c, y = 1/1 — 82), where erms is the rms area of phase 
space occupied by matched beam. The quantity ey is called the normalized 
emittance. The factor Gy makes ey a constant through acceleration — note 


that ey X rms By ox By) (22) (a2) — (xa)? ox 4/ (a2) (p2) — (£p). The factor 


6 means that the size is 60 ~ 2.450, which contains ~ 90% particles. As a 
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FIGURE 9.4: Lattice functions of a FODO cell at the Fermilab Main In- 
jector. 


result, we obtain 


Pimax EN 
Tmax = Ymax = ~ By ` 


At injection, with po = 8.9 GeV/c, we have 7 = 9.54, 8 = 0.994, so we 
have max = Ymax ~ 16 mm. For the horizontal beam size, we have to take 
into account the momentum spread. We assume óp/po ~ 0.3% and have 
tp = Dyaxdp/po = 6 mm. As a result the total horizontal beam size is 
cla. Vinx +7, ~ 17mm. So at injection, the full beam is about 34 mm 
wide. At extraction, the momentum is 150 GeV/c, 6 ~ 1 and y = 160, so 
we have imax = Ymax = 4 mm. Since óp/po scales with y, the momentum 
spread becomes 0.02% and xp = 0.4 mm. Thus, we have zf, ~ 4 mm, and 
the full beam at extraction is 8 mm wide. This is called adiabatic damping. 
With acceleration, all phase space variables scale the same way. As a result, 
the shape of a bunch does not change. It is illuminating to compare radiation 
damping and adiabatic damping. In the case of radiation damping, po remains 
constant while py, p, and K decrease. (The radiated energy is recovered by 
the radio frequency (RF) cavity.) During adiabatic damping, pz, p, and ôK 
remain unchanged, while po increases. Both dipole and quadrupole magnets 
have to be ramped to keep the design closed orbit and tunes constant. The 
stronger force results in a smaller beam. 
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Finally, let us look at chromaticities for FODO cells. 


— 1 E 1 Bmax Pin 
E= -E focos = -Ly ( eae ) 


1N " u 2l 
Ec Paul 7) 
4n f ( a dorem n 
N Asin(u/2), . p N u tan(u/2) Nu 
= — —— — 2 —) = —— t =) = — V1 = —-). 
4r sinp i. T an() n/2 ^' ( on) 


For the Fermilab Main Injector, 


pp =26.425 co = —33.6, 
, Vy = 25.415 => & = —324, 
which is not far from the exact values (£, = —33.6 and £, = —33.9) showing 


again the usefulness of the thin lens model. Without correction, and assuming 
the momentum spread of +1%, we have 


A 

Av, = & P = +33.6 x 0.01 = +0.336, 
Po 
A 

Avy = y— = +32.4 x 0.01 = 40.324. 
Po 


Clearly, without chromaticity correction, the momentum acceptance of the 
ring would be very small (probably below 0.1% due to the fact that the dis- 
tance between v, and the half-integer is only 0.075). To correct chromatic- 
ity, we place two sextupoles in each FODO cell, one next to the focusing 
quadrupole and the other next to the defocusing quadrupole. Using the thin 
lens model of the sextupoles and ignoring the distance between the sextupoles 
and their adjacent quadrupoles, we obtain the total chromaticities from eqs. 
(8.4) and (8.5) 


1 1 1 
= re Bm (5 = Dash) oe Bmin (-5 m Danks )| ? 


& =~ gc d 89 a(s) + Da(s)ke(9)] ds 


1 1 1 
= rd Bin (-5 + Dass) + Bmax G + Duinksn )| , 


where ksp and ksp are integrated strengths of the sextupoles next to the 
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focusing and defocusing quadrupoles, respectively. When 


pr 1 = sin (44/2) 
SU Dae  2f20[1 + sin(u/2)/2]" 
NE 1 _ sin (44/2) 


fDmin —— 2f?0[1 — sin(u/2)/2]" 


the chromaticities are corrected. Note that the relation | = 2f sin(u/2) is 
used to obtain the above expressions. 


9.1.1 The FODO Cell Based Achromat 


Achromats are needed because dispersion free straight sections are needed 
in both circular accelerators and beam transport lines. Achromatic sections 
are also required in circular machines, where, for example, straight sections 
that house injection and extraction kickers are dispersion free to make the 
beam small. The straight section where RF cavities are located is also dis- 
persion free. Passing a RF cavity with z-ó correlation produces coupling 
between transverse and longitudinal motion, which is usually undesirable. 
In the case of beam transport lines, achromatic conditions have to be met 
when the matching requirement is such that the line is imaging or the line is 
isochronous. 

There are mainly two types of achromats, those that utilize repetitive sym- 
metry and those that use mirror symmetry. Let us consider a system that 
consists of n identical cells. 


i (zjx) (xla) (w|8) Ê d 
M = | (a|z) (ala) (ad) | = l 
0 0 1 0 1 
, Rd\(kR d Ê o (R-Id 
E = , 
0 1 0 1 


When Ê” = Î, i.e., p = (m/n)2m, 
M" =. 


which is an achromat. 
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For a stable FODO cell, which is what we are interested in, we can write 
the matrix R explicitly, which is 


R= cos y + asin u sinp 
—*ysinup  cosu— asinu j` 


In the normalized space, 


R= ARA = cos jt sinp 
— siny cosp 


where 


"Xa 
dm mcm 


with d = (x|ó) and d’ = (a|9). As a result, we have 
zE cos (k in (ku) 
m Geeks | uU 
ak xf cos(ku) sin(kp) 
33 B 2. > — dnte cos 2 
n—1l (ete tech) /2 pe —e 3) /2i 
bu = pce /2i (eiu ge gH /2 
B a (A — B) /2i 
—(A- B)/2i (A+B) /2 


where 
n—1 ih n—1 E 
d = H = inu 
A= J eke — : E B= 2 e — enis A 
1— ein? I- erin 
k=0 


It is obvious that when u = (m/n)27, we have m — 0 and B — 0, which leads 
to the achromatic condition. 

Achromats of this kind have been used as the arcs of storage rings, beam- 
lines and spectrometers. Examples are the 90? arcs of the South Hall Ring 
at the MIT-Bates Linear Accelerator Center at Massachusetts Institute of 
Technology, Massachusetts, USA, the 180° arcs of the storage ring at the 
Duke Free Electron Laser Laboratory (DFELL) at Duke University, North 
Carolina, USA, and the arcs of the ILC (International Linear Collider) Beam 
Delivery System at SLAC National Accelerator Laboratory, California, USA. 
There was a time-of-flight spectrometer built at Los Alamos National Labo- 
ratory, New Mexico, USA, that consists of four identical cells with u = 90°. 
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It turns out that this kind of system not only cancels dispersion, but also 
cancels all second order geometrical aberrations. Recalling the equations of 
motion (3.22), second order geometrical aberrations generated in the short 
interval [s,s + ds] can be written as 


zf = £i + hdstidi, ag = ai + S T k mnei AUC UP. 
yp — yi +hdszibi, by — bi +X Tok i mmataly™b?, 


where the summation )> is taken over k,l, m, n from 0 to 2 such that k +1 + 
m +n = 2; so in the above, 
2 


5 reads as 5 


k,l,m,n=0 
k+l+m+n=2 


This simplified description is used in the rest of this section unless otherwise 
noted. Here the linear matrix from s to s+ds has been removed by the inverse 
matrix. As a result, the second order map over the interval is lumped into a 
point. Applying the transformation 


g x /B 0 0 0 x 
a ua aj | Of Al Br */ Bs. 0 0 a 
V| oly]. 0 0 1//By 0 yl 
b b 0 a Bo 7 \b 
we obtain 
qj g/A/ B. x +hdsxa V Bag 
Gf Hu (ou 4-B5a)//Bs | [a +5 Takim az alyor] | Coszi--à;)/vBa 
W|| w/&A | y + hdsab Bj 
by — \(ayytByb)/VBy) \O+ YT kim nr hay") NCoyg- b) By. 


T; + (hds/V/Bz) Ti (—o d; + d;) 
Gi + Y Tu m dal OP 
i + (VBzhds/ 8) ži (—ayiii +È) 
b+ CT us niigi 
where Ty,k,1m,n and Tj, 1,5, are linear combinations Ty,k,t,m,n and Th k tm,n, 


each of which is multiplied by some powers of Br, by, o, and/or oy. Now let 
us consider a system that consists of n identical cells with the phase advances 


is = fly = 2m n. 
Defining 


> 
oe 8 8 
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the sum of all the n kicks over the whole system is 


E, (hds/A/B.) x (—asx + a) 


s |-ma(p-me-t). |, Chainer 

Is oz (em) h- oy (VBzhds/By) £ (ayy +b) 

b, YT, uma Faly"b? 

Ta; + Ox 
5 ts T a 
(hds/V/Bz) x (azz + a) 
_ > r( 2mm /n J , YT. kim nt aly™b” 
mm —2mmn [n — by (V/Bshds/ By) x (—ayy + b) 


SS Tennis a y "tb" 


R 2mm/[n + be 
2mm/n dy)” 


The last step uses the fact that nur = nuy = 27. Every term in the above 
expression can be written in the form of 


n 2mm 2mm 
5 cos! pum + 2) sin? (= + 2 : 
n n 


where | = 0,1,2,3 and j,k = (x, y). Using the relations 


ci 4 et? . gió — e~it 
cos $ = ——— —, sing = ————, 
4 2 2n 2j 
we have 
= 2mm 2mm 
5 cos! (= + 6;) sin?! (= + 2 
TL TL 
m=0 
n—1 gi(2mm [n ó;) zi eg (Qmm /nó;) l gi(2mm [n ó;) _ e iQmn /nó;) 3-l 
i 2 2i 
m=0 


Dropping the common parts in each sum, which are functions of $, and $,, 
there are only four kinds of sums 


n—1 in6r/n n—1 —in6r/n 
X eibmn/n = l-e i =0 X e 6mm [n = l—e =0 
Ea 1 — ei67/n m 1—e-i67/n ? 
m=0 m=0 
n—1 1-— cin2n /n n—1 J= ea in2n [n 


3 gi2 mm /n = E =0, 


m=0 


y» eg mmn n = =a = 0, 


m=0 
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when n Æ 1,3. In conclusion, for a system that consists of n identical cells 
(n > 1, n £ 3) and pe = fy = 2n /n, all second order geometrical aberrations 
vanish. Note that this is true even for systems without midplane symmetry. 
When coupling is present, the second order kicks for x and y will become more 
complicated but remain a polynomial of the second order. The transformation 
to the normalized coordinates A will be coupled as well, yet the general form 
of the kicks in the normalized space remains unchanged. Therefore the same 
proof holds. 


Another result of such a system is that some chromatic aberrations are 
canceled. Of all the remaining chromatic terms, only two are independent. 
With two families of sextupoles, all second order chromatic aberrations can 
be corrected. Thus we obtain a system that is free of all aberrations up to the 
second order, which is called a second order achromat. From the symplectic 
condition, we know that, up to the second order, the path length depends on 
ô only. 


Next, we are going to prove that only two independent families of chromatic 
terms are left. Going back to the equations of motion, the second order 
chromatic terms are 


Tf = Tti F Tr 45046, af — iT Ta,25 X40 T Ta 520°, 


Yf = Yit Ty bóbi ô, br = b; + Toys yið. 


In the normalized space, the map becomes 


Tj z//D. t + Tr aað V Bek: 
ar (oux T bza) [V Bs a+Ta 260d + T, $2 62 (-osdi T Qi) / V/ Bs 
yr y/ A/ By yt Ty,p5bd By 
by) \(ayy + Byb) / Py b+ T) sy (7o +b) / VB, 
z/ / Bs V/ Bs + Tras (arti + ti) / V/ Bs] 6 
(as + Bua) //Bz | | Oxi + Gi) / V Bx + Tuas VBrbi5 + Tu an 
| uym | VB at + Teas (esi) VB] 6 
(ayy + Byb) / VB (-ayii b) [Af By + To us / By io 
Ti I Trad [( OTi t ai) / Be] ô 
ait |(B2Ta z Pr 02T;. a5) /B«] Tið = (o / Bx) Tr addið - / B T4 520? 
Yi + Ty,os (esi b) /& 6 
bi + [(82T5.us ES o2T, v) / By] 9&9 + (ey / By) Ty p5bi6 
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The sum over the system is 


Vr 

T n—1 

af —2mm [n — bx 
t = R 

yr 2. Bim 
br 


Tz,að [(—o2 + a) / Be] 6 
| (62Ta,26 — 02T 2,05) / Bs] £6 + (An /Bx)T2,0505 + / Bs Ta, 520? 
Ty v [(—ayy + )/By] ô 
[(83Tv y5 — oo Ty v5) / By] YS + (ay/By) Ty, bab 


oR [ue 
2mm/n 4$, J' 


Since the x and y planes are decoupled, we can separate them. The x plane 


is 


E ‘| p (2mm /n + dz) 2 


sin (2mm/n-- z) | cos (2mm /n + dz) 


— (af Bs) Tx að (1/85) Tr aô 
[(62 Ta, zô — Qr 2T as) / B] 6 (a / B.) Ts,a50 


= cos (2mm /n + dx) 2 G 


sin (2mm/n + x) cos(2mr/n + $4) dj 


iM 


V3 f cos(2mm/n-- à;) — sin (2mn/n + ox) 0 
sin (2mm/n-- $4) | cos(2mm/n + dx) V/ Bs T, 520? 


m=0 


and the y plane is 
va (( cos (2mm/n -- dy) — sin (2mm/n + dy) 
sin(2mm/n + oy) | cos (2mm /m + dy) 


^ — (ay /By) Ty b50 (1/8,) 25 
[( 


Mi 


m=0 


elias E aZTy vs) / By] ô (Gy BUT, y 550 
cos (2mm/n +y) sin (2mm/n + ¢y) Yi 
l — sin (2mr/n + dy) cos (2mm/n + py) bij 
The terms (z|ó?) and (a|ó?) are 


n—i n—1 


— . (9mm giQmm /nt-ós) _ e—i(2mn/n+ox) 
S (24a) eer 


cin?" n —in2m/n 
Epi eis / M p: / E 
] — ei27/n 1 — e-i2"7/n ? 
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n-1 al) 3 s 
2mm giQmn /nós) 4 g- iQ mn/ntós) 
X cos (= ds de) = J a ee 


m=0 m=0 2 
in2m/m —in2m/mn 
sd lec M cuim Le 
2 E= ei2n/n Tes e- On/n 


Every other term in the above expression can be written in the form of 


-1 praa va eg iQmn /n$;) | l LÀ - e iQmn /nó;) | 2-l 
0 


2 2i 


where | = 0,1,2,2 and j = {x,y}. Dropping the common parts in each sum, 
which are functions of $;, there are only three kinds of sums 


n-1 _ eindn/n n-1 Pa eg in4n/n n-1 


y giemn [n = rs 5 e mnn = re, 5 l=n, 


m=0 m=0 m=0 


where V7. 1 e47/" = 0 and $777. e-*1m/^ — 0 when n # 2. As a result, 
we have 


m=0 m=0 

n—1 

SS sin (= es) cos (= +45) =0 
m=0 


2 2 2 2 
» B ls,ró KE OT rad By To,y6 al ay Ly,b5 


à, C 
y By 


y,b50 = Ô, 


UE 
By 
which shows that there are only two independent terms. 


It turns out that there is another way to prove this point which is probably 
more elegant. We first observe that, from the equations of motion (3.22), the 
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second order map of the n cell achromatic system in the normalized coordi- 
nates can be written as 


i, Hi + Tr rotið + T, asdið + Ty 5267 

ay MEL Ta sadi $ Ta asi + Ty 525 (9.2) 
Yf Yi + Ty, ysid + Ty, b5bið 

by bob Ty usó -— Tp ss bió 


where all geometrical terms vanish. Note that midplane symmetry is obeyed. 
Since the x and y planes are decoupled, let us study the x plane first. Let us 
denote the second order map of the whole system 


ML (n) 2 £T (n), 
that of one cell is 
M (1) 2 R (1) - T (1), 
and that of n — 1 cells is 


M (n — 1) 2 R (n—1) - T (n - 1). 


Since the n cells are identical, the whole system can be viewed as either one 
cell in front of n — 1 cells or vice versa. Thus the following relations hold 


M (n) 23 M (n — 1) e. M (1) 2a [^R (n 2 1) + T (n 2 1] e [R (2) + T (1)] 
—3 1-4 R(n-1)oT (1)-- T (n- 1)o R (1), 


and 


M (n) —2 M (1) o M (n = 1) =2 [ (1) +T (1)] e [R (n — 1) +7 (n — 1)] 
=2I +R (1)oT (n—1)- T (1) o R (n— 1). 


Removing the first order part, we obtain 
T (n) - R(n-1)o T (1) - T n-1)eR (1), 


and 


T (n) 2 R(1)o T (n— 1) - T (1) e R(n— 1). 
Furthermore, we obtain 
T (n)oR(1) ! 2R(n—1)oT (1)o (1) ! & T (n— 1), 


and 
R(1) oT (n)=T(n—-1) + R(1) oT (1)oR(n—1). 


Using the relation R (n — 1) = R (1) ! , we reach the following relation 


T (n) o R (1) 2 R (1)o T (n). 
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Plugging in the first two components of eq. (9.2), we arrive at 


COS Uy SiN [ly T, 5267 T, 5207 
— sin Ja COS Ug T, 520? B T, 520? 


and 


Tossb T, asd COS [lz Sin Hg a COS [lz SiN Hy Ta xô To asô 
Ta oô Ta ab —sin Uy COS Hr — sin [lz COS [i Ta 290 Ta asô 


For the pure chromatic terms, we have 


1 — COS uy —SiN Hg T, s 0 
sinus 1—cospy Ta o2 |2040J. 
Since, for n > 1, we have 


l—cosu,  —sinu; Y _ 8 
det ( sin Hz E en COS Has) 0. 


we conclude that 


T, o2 = 0, Top =0, for n>l. 


For the mixed chromatic terms, we have 


fis COS [Ay — La,a6 sin La Tr xô sin Hx T Lz,a8 COS Hg ) 


Ta,xô COS Uz — Taas sin Hz Tors sin Hx T La,aô COS Ug 


—T x26 sin Hx T La x COS Ur —Lx,a6 sin Hæ T fa,aó COS Hg 


E | o a8 COS Ux T La,xd sin La Tzr,aô COS Uz T La,aó sin Hz ) 


For each component, we have 


—Ti.aó sin Ha = Taxó sin Has Ty ,26 sin Ha = Tani sin Has 


—Ta að Sin ay = —Ty 25 SÌN Ug, Ta,xô SIN Hg = —Ty aô SÌN Hg. 
For n Z 2, sin ps zz 0. We have 
Ta zô = Taas =0, Teas + Tus — 0. 
From symplectic symmetry, we have 


1+ Ta zô T. asô 
det A M eH =1 
Ta,250 1+ Ta,a50 


| 
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up to the first order of 6. As a result, we have 
Tars T Toas = 0, 
which, combined with the previous result, leads to 


Tx,05 = 0, Toas =0. 


In the original space, 


and 
ma) m s ne) Ty „aĝ Â 
Ta,xô Taas Ta,26 Tg 
Teas 1/WBe 0 
oe "m Tras 0 O/N/ Bx VB 


= {e Ps y Tzr aô- 
— Yr —Ay 


The Twiss parameters here are the periodic solution of the cell. In conclusion, 
there is only one independent chromatic aberration in the x plane. The same 
conclusion can be reached for the y plane following the same procedure. This 
proves, from the global point of view, that only two second order chromatic 
aberrations are independent. - 7 

Furthermore, it is easy to show that the remaining terms Ty a and Ty bs are 
simply the chromaticities. We can write the transfer matrix in the normalized 
space as 


db 1 Tar asô B cos (27) sin (27) + To asô 
7 EXE asô 1 ^" \—sin (27) — p cos (27) 


cos (27 + Tras) sin (27 + 1,56) 


— gin (27 + Taas) cos (27 + Taai) 


—1 


and we obtain 


£s = imgz,aó- 


Similarly, 


: cos (27 + T, asd) sin (27 + m) 
M, =1 i - 
— sin (27 + T, asi) cos (27 + 1,096) 


, 
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and £ 
&y = Ty,bs- 


The simplest of such a second order achromat consists of four FODO cells 
with Uz = Hy = 1/2 and two families of sextupoles correcting the chromatic- 
ities. 


9.1.2 The Dispersion Suppressor 


As it become clear shortly, an achromat of n identical cells is not optimal 
in terms of minimizing Dmax. Let us consider half of an achromat, 


~ f- d 
eps). 


For an n cell achromat, D, D' at the center are 


D -1 0 d 0 d 
D'|- 0 —1 d' 0]2[d'], 
1 0 0 1 1 1 


whereas the periodic solution is 


D -1 0 d D 
D' 0 —1 d' D' 
1 0 0 1 1 

d d' 

=> D=- D'2-—. 

2' 2 


Obviously the dispersion at the center of the achromat is twice that of the 
periodic solution of a cell. There is a module called dispersion suppressor 
which makes the whole section an achromat while maintaining the periodic 
solution in the regular cells. It takes advantage of the fact that dipoles, espe- 
cially when 0 is small, affect only the dispersion, not focusing. A dispersion 
suppressor consists of two FODO cells which are the same as the standard 
cells except for the bending angle. Two free parameters, or *knobs," such 
as the the bending angles can fulfill the two conditions needed to obtain an 
achromat. 
Recalling eq. (9.1), the transfer matrix of a FODO cell with bending is 


1-P/2f? — 210 + 1/2f) 210 (1 4- L/Af) 


M, =| -1/2f?+i?/af? i-Py2f? 26(1-1/4f — /8f?) 
0 0 1 
COS u GBsin p 20 [1 + (1/2) sin (u/2)] 8 


=| —(1/8)sinp cosu 2[1 + (1/2) sin (u/2)] [1 — sin (u/2)] 0 
0 0 1 
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For two cells of bending 01 and 62 per half cell, the total matrix is 


COS u Bsinu 20 [1 + (1/2) sin (u/2)] 02 
My | -/É)smp cosy 2[1 + (1/2)sin (u/2)] [1 — sin (u/2)] 62 
0 0 1 
COS u B sing 20 [1 + (1/2) sin (u/2)] 01 
—(1/8)sinu cosu 2[1 + (1/2) sin (n/2)] [1 — sin (u/2)] 01 
0 0 1 


cos (2u) Bsin(2u) d 
--ü/8)smQu) cosQu) d' 
0 0 1 


, 


d —2l | n sin (5) 6; cos pp +2 | + 5 sin (5) [1 -sin (£)] 6,8 sin p 
n» h + 5sin (£)| 0; 


—2l i + L sin (5) [(2 cos u + 1) 6 + 65], 


=2 | + sin (5) [1 — sin (£)] [(2.cos u — 1) 01 + 65]. 


Note that the relation 
g — 2H + sin(u/2) 
7 sin u 
was used during the derivations above which lead to the final forms of d and 
d’. 
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To match a dispersion free region to a FODO cell, we have 


COS (2u) B sin (24) d 0 Dmax 
—sin(2u)/B  cos(2u) d’ 0|- 0 ; 
0 0 1 1 1 
which leads to 
0 


dou eS e 
oco SUN oe C 


(2cos ui — 1) 01 + 05 = 0. 


'The result is 0 
Bim Adres 
|^ Asin (u/2)? ^ i 


So, we have 


e| 3 t5| 3 


Dispersion suppressors are widely used in high energy accelerators where 
the achromatic straight section constitutes only a small portion of the ring. 
Since the main part of the ring is made up of arcs, the cost-effective way to 
build such a ring is to pack dipole magnets as close as possible. A FODO cell is 
the best choice for this purpose. Another way to save cost is to keep the beam 
pipe as small as possible, which saves not only due to smaller pipes themselves, 
but also, more importantly, smaller magnets. Dispersion suppressors help to 
keep beam size small by keeping the dispersion matched. In fact, there is 
another parameter that plays an important role in the optimization process, 
which is the length of the FODO cell. Both Bmax and Dmax are proportional 
to the length of the cell. A shorter cell leads to smaller beam size, but tends to 
decrease the packing factor, which is the ratio of the length of total bending 
over the total length of the cell. 


9.2 Symmetric Achromats 


Achromats based on mirror symmetry are widely used in beamlines and 
accelerators, especially synchrotron light sources. One difference between a 
synchrotron light source and a high energy accelerator is in the number of 
experiments it supports. While a high energy accelerator usually supports 
around ten fixed target experiments and a handful of collider experiments (less 
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than five), a synchrotron light source usually supports tens and sometimes 
more than a hundred experiments with the circumference of the ring only 
a fraction of the high energy counterpart. Furthermore, insertion devices 
(wigglers and undulators) have become the main source of light, as opposed 
to bend magnets. These requirements result in a ring divided into many 
sections (usually identical ones) with long straight sections in between where 
dispersion is either zero or small. Apparently FODO cells plus dispersion 
suppressors are not well suited for this kind of ring. The solution has been 
mirror symmetric achromatic sections with relatively long straight sections at 
the ends. 

Before going into the details of the lattice modules, let us first look into 
the general properties of a mirror symmetric cell. Mirror symmetry here is 
referred to as the symmetry between the cell and its mirror image of the x-y 
plane. In other words, a mirror symmetric cell means that the optical elements 
of the cell are symmetric about the center, both in terms of geometry and the 
excitation of the fields. For example, a quadrupole is mirror symmetric and 
a sector bend is mirror symmetric, too. When a cell is mirror symmetric, the 
map of the cell is the same as that of its mirror image. To obtain the map of 
the mirror image cell, we observe that a particle that enters the mirror image 
cell with (zy, —af, yr, —b5) exits it with (£;i, —a;, yi, —b;), where (zi, ai, Yi, bi) 
and (xf,af,yf,bf) are the entrance and exit coordinates of the transverse 
phase space of the original cell. Hence the map of the mirror image cell is 


M! =Ro MoR}, 


where 
1 0 0 0 0 0 x 
0-1 0 0 0 0 a 
0 0 1 0 0 0 y 
om 0.0 0-1. 0 0 b 
0.0 0 0 1 0 l 
0.0 0 0 0 1 ô 


. 1 
Mi-2|0- 
0 


x 


Mirror symmetry entails that M, = MI , which leads to 


(z|a) = (ala) 
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and 
1+ (riz) 
(z|a) 
Note that the two equations from the dispersion and the dispersion prime are 
not linearly independent. It is easy to verify that the transverse linear matrix 
of a magnetic sector dipole satisfies the above relations. For a linearly stable 
cell, i.e., |(z|x)] < 1, mirror symmetry implies that 
(x|x) — (ala) 


— ————— =0 
xs 2 sin Jt. f 


(alô) = (x|6). 


and 


(1 = (zlæ)) (alô) + (alz)(mló) _ 1- (ax)? + (ala) (ale) (2|6) =0 

2 — (ala) — (ala) [2 — (z|z) — (ala)] (x|a) 
Alternatively, we can also express the transfer matrix of a mirror symmetric 
cell as functions of the first half of the cell. If the matrix of the first half of a 
mirror symmetry cell is 


D'= 


The matrix of the whole cell is 
ÛT = MIN 

(aja); (zļa)ı —(ala)i(zló)1 + (zla)ı(aļő)ı V f (xlv). (zlļa)ı (vló) 
= | (le (xx) —(alx)i(2]d)1 + (xlx)i(aló) || (ale)ı (aļla)ı (al0)i 


0 0 1 0 0 1 
(|i (aja) + (rla)1 (ala) 2(v|a)i (ala) 2(v|a)i (a9) 
= 2(2|2)1 (ala) (|i (ala) + (zļa)ı (aļx)ı 2(a|x)1 (a]0)1 
0 1 


In addition to the relations obtained above, this alternative expression shows 
that when (a|ó)1 = 0, the cell is achromatic. Furthermore it reveals another 
relation, which is 


— 


|x), 


z|a)1 


(aļð) = (x|6). 


—a 
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To make the meaning of the relation clearer, let us add the drift of length L 
after the cell. We obtain 


1 L|((aa (xa: | _ f (ala + L(alz)i (vla)i + L(xlz)s 
0 1j (al) (ale) (a|ai (2|”)1 


and 

1 L\ ( 2(ala)i(alé)1 V — ( 2[Gla)i + L(z|z)1] (al) 

0 1 2(x|z)1(a|9)1 2(x|z)1(a|ó)i 
When L = —(z|a)1/(z|x)1, the second half of the cell forms an image and the 
dispersive ray crosses the axis. In other words, the dispersive ray behaves the 


same as the axial ray from the center of the cell. The reader can check easily 
that it is indeed the case for a sector bend. 


9.2.1 The Double-Bend Achromat 


Now let us study the simplest mirror symmetric achromat, which consists 
of two bend magnets and a quadrupole in the middle. Due to the mirror 
symmetry, the achromatic conditions D — D' — 0 at the end can be satisfied 
requiring D' — 0 at the center. 


De 1 00 1 40 1 L L8/2 0 

o A EF P 1 0 010 01 9 0|, 

1 0 01 0 0 1 00 1 1 

D, 1 Eg (L/2 - L1)0 0 
0 |=| —1/2f 1—(L+ZL)/2f [1 (1/2f)(L/2+ D4)j0 0 
1 0 0 1 1 


L 1/L 
= D.=($+h)0, f=3($+n). 


From the discussion above, the relation f = (L/2 + L,)/2 is simply the result 
of the mirror symmetry. Even with a large bending angle where the exact 
matrix of the bend has to be used, being achromatic always implies that the 
center of the first bend is imaged to the center of the second bend. Since 
f < (L+ L1) /2, it is not possible to build a FODO cell that is stable, so 
a doublet or a triplet has to be used. A simple variant of the double-bend 
achromat (DBA) is the triplet DBA shown in Fig. 9.5, which contains a triplet 
between the bending magnets, with no quadrupoles outside. 

Another type of DBA consists of two bending magnets, a focusing quadrupole 
in between and a doublet outside of each bend as shown in Fig. 9.6. The cell 
is symmetric about the center of QF1. Fig. 9.7 [66, 21] shows an example of 
the lattice function of one example of this type of achromat. 
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FIGURE 9.5: The simplest double-bend achromat (DBA). 
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FIGURE 9.6: The double-bend achromat (DBA). 


9.2.2 The Triple-Bend Achromat 


The fact that the center quadrupole of the DBA images the center of the 
first bend to that of the second makes the DBA lattice somewhat inflexible 
since the horizontal phase advance between the centers of the bends is always 
around m. To overcome this shortcoming, triple-bend achromat (TBA) 
lattices were developed. A TBA consists of three bending magnets, at least 
two quadrupoles between them, and doublets (or triplets) outside, as shown 
in Fig. 9.8. Lattice functions for a typical example of such kind of achromat 
are shown in Fig. 9.9 [65, 21]. 


9.2.3 The Multiple-Bend Achromat 


In the past two decades, the concept of multiple-bend achromat (MBA) has 
been conceived of and developed to further reduce dispersion in the bending 
magnets. As discussed in Section 9.2.4, this will help reduce the emittance 
of the electron beam and increase the brightness of the X-ray produced from 
synchrotron radiation. Fig. 9.10 shows the first MBA lattice, developed at 
the MAX IV Laboratory at Lund University, Lund, Sweden. The middle 
units are very similar to regular FODO cells and the end units are used as 
dispersion suppressors. Recent variants increase the distance between the 
outer most bending magnets and the middle ones to generate a dispersion 
bump there. As a result, the strengths of the sextupoles are reduced and the 
dynamic aperture is enlarged. The complexity of the lattice makes a multi- 
dimensional optimization tool a necessity. In that regard, the spread of various 
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FIGURE 9.7: Lattice functions of a double-bend achromat (DBA), which 
is one of the four super-periods of the storage at Center for Advanced Mi- 
crostructures and Devices (CAMD) at Louisiana State University. 
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FIGURE 9.8: The triple-bend achromat (TBA). 


numerical algorithms in the community greatly expedited the development of 
this concept. 


9.2.4 The H Function 


The design of such triple-bend achromats is mainly driven by the demand 
for small equilibrium emittance for synchrotron light sources. Although syn- 
chrotron radiation is not covered in this book, it is important to introduce the 
concept of the equilibrium emittance which results from synchrotron radiation, 
since it is crucial for understanding the motivation behind the development 
of lattice modules for the synchrotron light sources. The main difference be- 
tween electron and hadron (proton, antiproton and ion) rings is synchrotron 
radiation. Since for a given bending radius the total radiated power is pro- 
portional to 74 (v = E/ mc?) , synchrotron radiation becomes significant at 
a much lower energy for electrons. The energy loss is compensated with RF 
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FIGURE 9.9: Lattice functions of a triple-bend achromat (TBA) of the 
Advanced Light Source (ALS) at Lawrence Berkeley National Laboratory, 
California, USA. 


cavities. 

Since the photons are emitted into a forward pointing cone of the opening 
angle 1/y which is small for electrons of GeV level energy, all three components 
of the momentum decrease at roughly the same rate. The RF cavity, on the 
other hand, increases only the longitudinal momentum. As a result, transverse 
momentum is damped over time. 

Yet the presence of dispersion in a ring causes the emittance to grow due 
to synchrotron radiation. Let us consider an off-momentum electron moving 
along the closed orbit for the momentum in a dispersive region. After a photon 
is emitted, the position and slope of the electron remain unchanged but the 
total energy decreases. Suddenly the orbit the electron moves along is no 
longer the closed orbit for it and the electron starts to oscillate around the 
new closed orbit, resulting in emittance growth. The equilibrium emittance is 
reached when the damping rate equals the growth rate. It turns out that, for a 
ring with an identical bending field, the equilibrium emittance is proportional 
to (H) where 


mag? 


H = yD? + 2a,DD' + 8,D'?, 


and 
1 


(H) mag = 2np TE Hds. 


Note that D and D’ are periodic solutions of the position and slope of dis- 
persion. Furthermore, H is a constant outside of dipole magnets and changes 
inside dipole magnets. To demonstrate this point, let us consider two points 
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FIGURE 9.10: The lattice functions z, By and dispersion multiplied by 
10 in the MAX IV multiple-bend achromat (MBA) lattice at Lund University, 
Lund, Sweden. (© 1996 IEEE. Reprinted, with permission, from D. Einfeld, 
et. al., in Proc. PAC 1995, 1, 177, 1996 [26].) 


in the ring. The linear map between them is 


Miz =| (alz)i2 (ala)i2 (aló)ua |, 


D» (z|x)19 (z|a)12 (a|d)12 Di 
Dz | = | (alz)i2 (aļa)ı2 (a|ó)i2 Di 
1 0 0 1 1 


Hence, we have 
Ho —5D2 + 205D3D] + B.D? 
N 
_ [oDi tie m) ra oa) |. x 


(zla)i2 (a|a)i» 2 b2 


(z|r)i2 (x]a)i2 | / Di (z|0)12 
os Ye [E i [s 
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+ ((x|6)12, (al0)12) E 5 ( 
( 
( 


Y2 Q2 
+ ((x])12; (@|6)12) & 7 | 


It is clear that Hz = Hı if (a|6),. = (alô); = 0, which is the case for any 
two points that are in the same straight section. With the expression above, 
we can obtain the derivative of H with respect to s which can illustrate the 
matter even clearer. When the two points are close to each other, the linear 
map becomes 


. 1 ds O0 
dM — | —kds 1 ds/p 
0 0 1 


Carrying out the derivation one step further, we obtain 


= F 1 —kds y a 0 
s tosno Cs T^) S) (ans) 


ds\ (49 a l ds\/D ds\ (72a 0 
S Be) (nde o) GRO Cane) 
= s 

ds yı Qı 1 kds Dı ds Y2 Q2 0 
"m a) Ca GS O) CE 8) Case) 


ds 
=; 2 (a1 Dı + 61 D1) 2 


In summary, we have 


2 
H= 3 (o1D1 + 6, D1). 


In order to achieve small emittance, has to be small, which leads to strong 
quadrupoles. This in turn leads to strong sextupoles to correct chromaticities 
which in general would result in strong nonlinear motion and small dynamic 
aperture. TBA lattices can provide smaller dispersion and hence smaller 
emittance than DBA lattices, which result in stronger sextupoles and smaller 
dynamic aperture. This is one of the reasons that TBA lattices fell out of 
favor in the most recent synchrotron light sources. Another reason is that, 
when the achromatic condition is not strictly enforced, DBA lattices appear 
to be more flexible than TBA lattices, especially when more quadrupoles are 
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used. For example, the DBA lattice of the Shanghai Synchrotron Radiation 
Facility (SSRF), Shanghai, China, contains two quadrupole doublets between 
the bending magnets and two triplets outside. In fact, almost all synchrotron 
light sources built in the past decade adopted DBA lattices. 


9.3 Special Purpose Modules 
9.3.1 The Low Beta Insertion 


In both circular and linear colliders, the beam is focused as tightly as pos- 
sible at the collision points to maximize the density of particles and hence 
collision rate. The simplest low beta insertion consists of two quadrupole 
doublets placed and excited symmetrically about the interaction point. The 
upstream doublet is roughly a parallel-to-point system, in which an initially 
nearly parallel beam is brought down to a small point and the downstream 
doublet is a point—to-parallel system. In hadron colliders where the emittance 
is relatively big, triplets are used to increase focusing power and reduce the 
width of the beam in the quadrupoles. In addition, it provides more flexibility 
for tuning. 


Recall from eq. (6.13) that 


2 


B(s) = B* Fo 


Since the distance between the interaction point and the last quadrupole is 
on the order of 10 m and £* is below 1 m, the 6 functions in the quadrupoles 
range from hundreds m to over 1 km (see Fig. 9.11). Combined with high 
gradient in the quadrupoles, the low beta insertion generates large chromatic 
and geometric aberrations. In hadron colliders, due to the relatively large 
emittance, the main effect of the aberrations is the additional chromaticities, 
which are corrected by the sextupoles in the arcs. In addition, the large beam 
size at the quadrupoles implies extra tight tolerances on multipole errors in 
those quadrupoles, which, if too large, would excite undesirable resonances 
causing emittance and/or beam loss. In electron-positron linear colliders, 
the emittance is small and the aberrations generated by the low beta inser- 
tion would cause sizable increase in beam size at the interaction point. As a 
result, the matching section between the low beta section and the linear accel- 
erator (linac) is rather complicated. Telescopes are used to minimize certain 
aberrations and sextupoles are used to correct chromatic aberrations. 
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FIGURE 9.11: Lattice functions of a typical low beta insertion with sym- 
metric quadrupole triplets. Here 6* is 0.5 m. 


9.3.2 The Chicane Bunch Compressor 


A simple yet very effective and commonly used module in linac based free 
electron lasers (FELs) is the so-called chicane bunch compressor. It consists 
of four identical rectangular homogeneous bending magnets separated by drift 
spaces, with the middle two magnets bending in the opposite direction, and 
the reference orbit perpendicular to the entrance of the first and third magnets 
and the exit of the second and fourth magnets (see Fig. 9.12). The whole 
module is mirror symmetric about the center. Such an arrangement ensures 
that the bunch compressor is achromatic to all orders and that electrons with 
higher energy go through shorter paths. When a bunch of electrons enters the 
compressor with a correlation between the longitudinal position and energy, 
the bunch length changes at the exit of the compressor. If the slope is negative, 
i.e., the electrons in the head of the bunch have lower energy, the bunch is 
compressed. 

Next, let us take a look at the basic optical properties of the chicane bunch 
compressor. The horizontal transfer matrix of the first bend is 


' 1 0 0 cos $ Rosing Ro (1 —cos¢) 
Mi =| 1/Rotand 1 0 —]/Rosinó cos¢ sin ó 
0 0 1 0 0 1 


cosó Rosing Ro (1 — cosQ) 
= 0 1/cos¢ tan $ 
0 0 1 


To obtain the transfer matrix of a bend magnet with the opposite direction 
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FIGURE 9.12: Layout of the chicane bunch compressor. The top/bottom 
trajectories are those of particles of lower/higher momenta than the reference 
trajectory in the middle. 


of bending, we have to first find out the transformation between internal and 
external coordinate systems. Taking into account the fact that positive z in 
the internal system (away from the center of the arc of the design orbit) is 
negative in the external system, the transformation is 


—1 0 0 
Sy, = 0-1 0 
0 01 


The transformation in the vertical plane is the identity matrix. Therefore the 
horizontal transfer matrix of the second bend is 


cos o Rosing Ro (1 — cos) 1 0 0 
M2-$,| —1/Rosinó coso sind 1/Rotand 1 0 | $z! 

0 0 1 0 0 1 
-1 0 0 1/cos¢@ Rosing Ro (1 — cos ¢) -1 0 0 
= 0-1 0 0 cos $ sind 0-1 0 
0 0 1 0 0 1 0 0 1 

1/cosó Rosing —Ro (1 — cos) 

= 0 cos $ — sin $ 
0 0 1 


The horizontal matrix of the first and the second bends separated by a drift 
Li is 


1/cos ó Rosin dé —Ro(1—cos¢)\/1 Li OV cos ó Rosin d Ro(1—cos ¢) 


Mh^-| 0  cosó —sing 010] 0 1/cosọ  tanó 
0 0 1 001 0 0 1 
1 2Hosinó + Li/cos4$ [2Ro (1 — cosQ) + Lı tang] / cosó 
=| 0 1 0 


0 0 1 
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From the mirror symmetry of the chicane, we can conclude that the module 
is achromatic. For ¢ < 1, the dispersion between the second and the third 
bends is D = L1¢+ Rog’. It is worth noting that the focusing in the vertical 
plane is insignificant. As a result the bunch compressor is transparent in 
transverse dynamics. 


Finally, let us work out the path length difference between the reference 
electron and one that has a different momentum p = (1+) pg. Due to the 
symmetry, the difference can be obtained analytically, which is 


l— lo = 4 (Rẹ — Bain) + 2L1 [=> = 1) 


where 


R=R(1+9), 


and 


R 5 
sinó — * sin o = TO 


Plugging in R and $, we have 


(1+ 6) cos po 
(1 + 6)? — sin? ġo 


sin Qo 
1+ô 


l—lo = 4Ro |a + ô) arcsin ( ) m 60 +224 


In order to have a better idea about the relation between | — ly and 6, we 
would like to learn how the low order terms behave. Before we proceed with 
the Taylor expansion of | — lo, let us first work out that of arcsin(zo + Ax). 


Using arcsin(z4/1 — y?+yv1 — z?) = arcsin(x)--arcsin(y) and setting x = xo, 


we obtain 
Xo 1—y?+y\/1— 22 = to + Az. 


After straightforward algebra, we obtain 


y = (zo + Az) 4/1 — 22 — zo 1— (zo + Az)". 
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As a result, 


arcsin (zo + Az) 


— arcsin (20) +aresin| ir + Az),/1— 22 — 21/1 — (xod Aa 


2xoA Ax? 
= arcsin (x9) -- arcsin le + Ax)A/l-zà-—scojl a 
— 2 


ee Ar? Ax? 
= arcsin (zo) --arcsina Ary/1—224+204/1 So b= a 
(1— a5) 8(1—22) 
2zoAz + Ax? 2 Ag? 
=p arcsin (zo) +Ary1- +21 — DARE M ds 
i= ae) 
Ax xo Ax? 
= + ———Ààá. 
Vl-—2% 2(1-— z2)? 


To the second order, the path length difference is 


=p arcsin (zo) + 


l — lo =2 4Ro |(1 + ô) arcsin ( (1 — ô + 6°) sin do) — ¢o] 
(1+6)cos¢o | 1 
V/ cos? po + 26 + 62 
=> Alto [(1 + ô) arcsin (sin ġo — (6 — 6”) sin go) — 0] 
(inso) D 
1 + (26 + 62) / cos? ġo 


=> 4Ro { (1-45) | do — (6 — 62) tango + 56 tan? go] — oo} 


126-0? 3 46? 
2 cos? do  8cos* do 


+21, 


+21, 


+ 2D, |a + 6) (: — 


= 4Ro |(^ — tan ġo) ô + ; (tan? po) a 
3 8 
m 214 tan? po (; — m . 


For ¢9 « 1, only the terms of the lowest order in $9 are important and we 


obtain " 
| — lo =o —2L1¢2 (3 — z0) ' 


As an example, we discuss the parameters of the first bunch compressor 
at the Linac Coherent Light Source (LCLS) at SLAC National Accelerator 
Laboratory, California, USA, which operates at a beam energy of 250 MeV. 
It is worth noting that the bend radius Ro of 2.48 m (corresponding to a 
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FIGURE 9.13: Mechanism of an RF buncher cavity. 


magnetic field Bo of 3.36 kG) and Lı at 2.61 m are roughly the same and 
that the bend angle of ¢9 = 4.62°, which corresponds to a magnet length 
Ly = 0.2 m is very small. For the term (1|ó), the contribution from the 
magnets is —1.74 mm and that from the drifts is —34.07 mm. For the term 
(1|02), the contribution from the magnets is 2.62 mm and that from the drifts 
is 51.44 mm. At this energy, the electrons are relativistic enough that the 
contribution from the difference in velocity is miniscule (—27 um for (l|) and 
40 um for (1|5?)). 


9.3.3 Other Bunch Compressors 


As shown above, chicane bunch compressors work when the higher momen- 
tum particles are in the tail of a bunch. Yet higher momentum particles are 
often in the head of the bunch. An important example is the DC gun, where 
higher momentum particles are faster and thus arrive earlier. To compress 
such bunches, one method is to configure electrostatic or magnetic fields in 
such a way that the faster particles go through longer paths. 

Another method is to reverse the correlation between energy and longitu- 
dinal position before the bunch enters a chicane bunch compressor. This is 
achieved most commonly through an RF cavity, which because of its func- 
tionality is often called a buncher. It is a regular RF structure which is set 
up through adjusting the phase such that the mean energy of the bunch is 
unchanged. Meanwhile, the head of the bunch, where the high energy par- 
ticles are, is decelerated, and the opposite happens to the tail of the bunch. 
Fig. 9.13 illustrates the mechanism of a buncher. When the particles are not 
highly relativistic, an RF buncher and the drift space downstream can achieve 
bunch compression. This is called ballistic bunching. 


Chapter 10 


Synchrotron Motion 


Up to now we have been primarily concerned with the motion in the trans- 
verse planes. Yet, for particle accelerators, as the name implies, acceleration 
is the primary interest. T'herefore the motion in the longitudinal phase space 
has to be understood. Although there are many different ways of accelerating 
charged particles, we restrict ourselves mostly to circular accelerators (syn- 
chrotrons, to be specific), where the acceleration is done using radio frequency 
(RF) cavities. The only exception is the last section, where the transverse dy- 
namics of RF cavities is discussed, which is of great significance mainly for 
linacs. 


The chapter is organized as follows. First, a section is devoted to a brief 
description of a typical RF cavity used in a ring. Second, the time-of-flight 
as the function of energy is derived. Next, combining the results from the 
previous sections, the map of the longitudinal phase space is obtained and 
the longitudinal motion is studied in detail. Last, the transverse effect of the 
RF cavities is discussed briefly. 


10.1 RF Fundamentals 


Most RF cavities used in synchrotrons are variations of the cylindrical pill- 
box cavity, which consists of two circular metallic plates of radius Re that are 
separated by the distance | and that are connected with a cylindrical mantel 
of radius Re, resulting in a geometry reminiscent of a circular pill box. 


The field distribution in the interior of such a kind of a metal box can be 
written in a simple analytical form. Following Wangler [70], the electromag- 
netic field of such a cavity of length l and radius Re with transverse magnetic 


DOI:10.1201/512074-10 241 


242 An Introduction to Beam Physics 


FIGURE 10.1: Typical RF cavity field with fundamental mode TMoio. 
Radial dependence of the normalized electric field E;(r,0)/ Eo (left) and the 
normalized magnetic field Be(r, —1/4f)c/ Eo (right) are shown as a function 
of xoir/Re. 


field (mode of TMmnp) can be written as 


E, = EgJq (kmnr) cos (m0) cos xis cos (wt), 


E, = E — Eo Jln, (kmnr) cos (m0) sin PTZ cos (wt), 
Ep = T c EoJin (kmnr) sin (m0) sin cos (wt) , 

B. =0, “ 

B, = wee Pom (Kar) sin (m0) cos a sin (wt), 
Bo =w P EoJ,, (kmnr) cos (m0) cos m sin (wt) , 


where kmn = Umn/Re and w = cy k2,, + (pr/0)?. Note that the quantity £mn 
is the nth zero of the Bessel function Jm(x) (excluding the origin, n > 0). 

Usually the fundamental mode of TMo;o is used for accelerating charged 
particles, whose field is 


E, = EoJo (=) cos(wt), E,=0, Eg-0, 


E, “oir\ . 
B,=0, B,=0, By = a (2 ) sin), 


where the relation Jj (x) = —Jı (x) is used to obtain Bọ (see Fig. 10.1.) 
Note that Bg is proportional to E^ with 90° phase lag, which is the result of 
Faraday’s law and that zo1 = 2.405 which, together with the design frequency, 
determines the size of the cavity. Specifically, for the mode of TMo10, we have 
To1ie — Loe 

w 2nf. 


Re= 
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For f = 500 MHz, R. = 0.2295 m. In realistic cavity designs, the actual 
shape of the cavity is often more spherical than cylindrical. Yet the overall 
dimension is not very far from this crude estimate. 

For r «& Re, which is usually where the beam is, the field can be approxi- 
mated by the lowest order term of the Taylor expansion, which is 


Eo XLoir 
Ez = E t), Bo=-— 
1 o cos (wt) 0 1 E 2R. 


sin (wt). (10.1) 


Note that Jo (x) = 1+0 (x?) and Jı (£) = x/2+0 (x?) for x < 1. As a result, 
the effect of Bg is much weaker than that of E; on the beam. Furthermore, 
the focusing effect of the magnetic field is usually negligible compared to main 
focusing elements, the quadrupole magnets in the ring. 

To illustrate this, let us look at an example. Let us consider again the case 
of a 500 MHz cavity. Assuming that Ey = 20 MV/m, which is not far from 
the breakdown limit of copper at this frequency, we obtain the peak gradient 
of the magnetic field, which is 0.35 T/m. Normal conducting quadrupoles, on 
the other hand, can have field gradient up to 20 T/m. Furthermore, there are 
usually tens to hundreds of quadrupoles with lengths between 0.2 and 1 m in 
a ring, whereas there are at most a handful of cavities with lengths usually 
below 0.5 m (around 0.3 m for a 500 MHz pillbox cavity). As a result, the 
integrated gradient of the cavities is on the order of up to perhaps 1/1000 
that of the quadrupoles. 

Recently, the dipole mode (TM110) has also been used to kick the beam 
transversely. The field of TM110 is 


d11T 


B, = EA (3 


) cos cos (wt) , E,—0, Eg-—0, 


c 


E c . : 
B, — 0, B,-2-— d: J (==) sin 0 sin (wt), 


C X11T c 
E Re ; 
by =È |^ (==) - #4, (==) EAE 


where the relation Ji (x) = Jo(x) — Jı(x)/x is used to obtain Be. Note 
that 21, = 3.832. For the same cavity, the frequency of the TM;19 mode 
is 411/201 £ 1.6 times that of the TMo1o mode. In order to get a clearer 
physical picture of the effect of the field on the beam, let us again perform 
Taylor expansion around the origin and keep only the leading term. The field 
is 


d11T 
E, =, E 
1 °OR 


cos 0 cos (wt), 
c 


E E 
B, =, CS sinsin (wt), Bọ — = cos sin (wt). 
2c 2c 
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Opposite to the TMo10 mode, the magnetic field has a much stronger effect 
on the beam. Furthermore, we have 


E, 
B, =1 B, sinf + Bo cos0 =; = sin (wt), 
c 


which is an alternating current (AC) dipole and is best suited for kicking 
the beam transversely. Again, using the example of a 500 MHz cavity and 
assuming that Eg = 20 MV/m, we obtain the peak field, which is 0.033 T. 
For such a cavity that is 0.3 m long and the energy of the electron beam being 
1.9 GeV, the peak kick angle is 


ev,;ByoAt — eEgl _ 20 x 10° x 0.3 


= mg a SG TOFS 
Ds 2p.c — 2x19 x 109 * 


Oy = 


The approximation that equates p,c to the total energy of the electron is based 
on the fact that the relativistic fact y is around 3800 and that the divergence 
of the beam is usually a fraction of 1 mrad. 

Now let us come back to the TMoi9 mode and find out the energy gain 
(AK) per pass. To simplify the matter, let us consider a particle that moves 
along the optical axis and the energy gain per pass is much smaller than its 
total kinetic energy (K), which entails that the change of velocity in the cavity 
is negligible. As a result, the energy gain is 


AK =q] E; (0,z,t(2)) dz, 


L 
2 


where t(z) = to + z/vo. Here we set the origin of the z-axis at the center of 
the cavity. For TMo19 mode, we have 


AK =q " Ep cos [wt (z)] dz — q Bacos fu (w+ 2) |a 


-4 -4 vo 
j i 5 
= iE | cos (^ 4 =) dz = Es | cos (o + zx) dz 
= vo -4 Bor 
sin (a1/ Bo.) 
= qEol = 
qEol cos (ġo) AUBA ^ 


where is the wavelength of the electromagnetic field and £o = vo/c. The sin 
function in the equation above is the result of the finite length of the cavity, 
which is called the transit time factor (T). For the TMoio mode of a pillbox 
cavity, the transit time factor is 


_ sin (nl/ 89A) 
B 7l/ Bor : 


and the energy per pass is 


AK = qEolT cos (¢o) . (10.2) 
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It is clear that the transit time factor is the ratio of the energy gain of a RF 
cavity to that of a DC (direct current) gap of the same field. The relation 
between the transit time factor and the length of the cavity is shown in Fig. 
10.2. It is obvious that T — 1 as l — 0 which means that, for constant voltage 
between the gap, the shorter the gap, the closer the energy gain to that of 
the DC gap. Yet the electric field breakdown limit of the material determines 
the maximum field that can be achieved, thus the energy gain is proportional 
to IT, which is in turn proportional to sin(z/ 9A). Therefore, the maximum 
energy gain for the case of constant field is obtained when | = 89A/2. For 
electron storage rings such as those of the synchrotron light sources, £o ~ 1. 
So we have | = A/2, which corresponds to the fact that the time an electron 
takes to pass through the cavity equals half of the period of the oscillation. 
The transit time factor is T = 2/7 = 0.637. For a 500 MHz cavity, we have 
| — 0.3 m. 

In addition, other issues such as RF power efficiency also have to be taken 
into account. The most used parameter measuring the efficiency is called the 
shunt impedance, which is defined as 


where 
AV = EolT cos (¢0), 


which is the voltage across the accelerating gap and P4 is the power dissipated 
in the wall. As a result, realistic normal conducting cavities are more or less 
spherical in shape, minimizing the total surface area, with nose cones around 
the beam axis to reduce the length (acceleration gap) of the cavity, maximizing 
the voltage across the gap. For superconducting cavities, the dissipated power 
is much smaller and hence the main goal of design optimization shifts to 
minimizing the peak field on the surface for a given on-axis field to reduce the 
risk of costly quench. The resulting shape is basically the bell shaped cavity, 
which is preferable for other practical reasons as well. 


10.2 The Phase Slip Factor 


Toward the end of the previous section, we studied the energy gain per pass 
of one particle. In this section, we will study the energy gain of a particle 
over many passes in a circular accelerator. Since the cavity is always designed 
for a given accelerator, there is at least one particle (the reference particle) 
that comes back to the cavity at the same phase (synchronous phase ¢,) 
every turn. For an arbitrary particle, it may not come back to the cavity 
at the same phase since it may have different energy. Although an arbitrary 
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-- 
1/Bor 


FIGURE 10.2: Dependence of transit time factor on length of the cavity. 


particle may not move along the reference orbit, which is a closed curve, 
the effect of the change of arrival time due to the transverse motion tends 
to average out due to betatron oscillation. This will become clearer below 
when the large difference between the frequencies of the transverse oscillations 
and that of the longitudinal one is shown. As a result, we only consider a 
particle that moves along the close orbit of its energy. Let us consider a ring 
with midplane symmetry that uses pure magnetic elements for bending and 
transverse focusing. Hence eq. (3.17) can be simplified as 


— RO E i] = 
1+ mo ps (6 


Hl m2 mu) 
"o(24-mo) - 


no K 
—|(1+h + 6 jug NU AM oae ZEE 
[ Ji EU. )/ ET 2+ No K vo 


where zs and as are the horizontal position and momentum of the closed 
orbit of an off-momentum particle. The difference in arrival time between the 
reference particle is 
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The phase slip factor is defined as 


At 
ph —_ =" 
ECC. 
F 1 
no + 10 no 
= 1—(1+h 1 ô 1+2 ô + ——— 8 — a2 |d 
EE | (1+ sr -R- J/ + 2 m mem ] E 
0 
= po ghe uu... (10.3) 


Note that the variable ô is defined as AK/Ko. In a pure magnetic system, mo- 
mentum is the more natural variable since it scales linearly with the magnetic 
field. To this end, we recall eq. (3.16) and have 


Using the relations 


A 
E Sh St i 1] 7 9o (14-0), 
Do Do 


we obtain 


aes = (+8) (14+ zs). 


After a little bit of algebraic manipulations, the exact functional relation 
between 6 and Ap/po is obtained, which is 


1+ 70 
no 


ô= -1 +,/1+ 


As a result, the phase slip factor can be written as 


At 
phem C ER 
"Ob 
E (o) 
= E no (2 + No " 2 3 
“aa (1+ has), / 1+ ia? es, capace de 
=n" +p +: (10.4) 


Taylor expanding eq. (10.4) to the leading order and taking into account the 
fact that 


we obtain 
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The first term is phase slip due to the difference in velocity and the second 
term is that from the difference in path length, which is called the first or- 
der momentum compaction factor a1. Since the velocity difference is inverse 
proportional to the square of yo, it becomes smaller as the energy increases. 
At the point that the two terms are equal, the phase slip factor changes sign, 
which is called the transition, which is 7, = 1/ ay. The significance of the 
transition is that the synchronous phase changes from a stable fixed point to 
an unstable one or vice versa. Specifically, for energy below transition, we 
have nP” > 0, which entails that particles with higher energy arrive earlier. 
Therefore the synchronous phase is a stable fixed point when $, lies between 
—r/2 and 0 (Fig. 10.3). For energy above transition, we have 7?" < 0, 
which entails that particles with higher energy arrive later. Therefore the 
synchronous phase is a stable fixed point when $, lies between 0 and 7/2 
(Fig. 10.4). In practice, the phase of the RF cavity has to be changed quickly 
from $, to —$, in order to keep the beam confined, which is called the transi- 
tion jump. It is not unusual that during acceleration, most beam loss occurs 
around transition jump. Figs. 10.3 and 10.4 also show that the maximum 
energy width of particles confined in the longitudinal phase space increases 
when the peak voltage of the RF cavity increases and/or |n!" | decreases. 

The second order effect becomes important when the first order term is 
small enough. From eq. (10.3), we can easily obtain the second order phase 
slip factor. The only complication is that the second order term of 6, has to 
be included. To the second order the relation becomes 


Plugging the equations 
z5 = Dó + D255, a5 = D'ó, 
into eq. (10.4) and expanding it to the second order, we have 


P=- a ol Gor 9 - 328a. 


'The integral form of the phase slippage factor shows clearly which quantity 
contributes. For example, only dispersion in the bending magnet contributes 
to n? and the slope of the dispersion everywhere contributes to 7". Knowl- 
edge of this kind helps greatly during the design of a ring. The computation 
of the phase slip factor, on the other hand, can be done through applying the 
periodic solution of the dispersion function (including nonlinear terms) to the 
fifth variable of the one turn map, which is 


At 1 
g- x — ci Mt (25; a5, Ôp) . (10.5) 
p p 
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V, AE, 


FIGURE 10.3: Sketch of phase stability for energy below transition, show- 
ing stable and unstable motion near the fixed points ¢ = ¢, and ¢ = —@s, 
respectively. 


Again, the fifth and the sixth variables here are defined as —vo (t — to) and 
dp. The one turn map M can be easily obtained using Differential Algebraic 
(DA) technique and the periodic solution of the dispersion function up to 
arbitrary orders can be obtained using the procedure of finding the parameter 
dependent fixed point (see Section 8.2.1). As an example, we study in detail 
the first order phase slip factor. The one turn linear matrix of the horizontal 
and the longitudinal phases spaces can be written as 


(dic) (214) 012 
M — Uz) (Ia) 1 (15) 
0 0 0 1 


a^ = 5 x) D + (a) D’ + (9. (10.6) 
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V, AE; 


FIGURE 10.4: Sketch of phase stability for energy above transition, show- 
ing stable and unstable motion near the fixed points ¢ = ¢, and à = —@s, 
respectively. 


Using the (/|x) and the (I|a) equations of (5.10) and eq. (8.3), we find that 
(Ix) = (ala)(a|6) — (a|x)(al6) 
= (a|x) [1 — (2|x)] D — (|a) D] — (a|x) [—(a|x)D + [1 — (ala)] D'] 
= (a|z)D + [1 — (2|x)| D’, (10.7) 
and 


(|a) = (a]a)(z]8) — (ala) (ad) 
= (ala) [[1 — (x|x)] D — (2|a)D"] — (ala) [-(a]z) D + [1 — (a]a)] D'] 
= —[1- (a|a)] D — (|a) D'. (10.8) 


As result, we have 


"ns 5 {(a|x)D? + [(ala) — (z|z)] DD' — (z|a)D? + (15) 
sin 2 / 12 (18) 1 ‘ 
= -^g- (s D? + 20,DD' + B.D") «55^ — -5 [H sint - (1/8). 


Before finishing the section, let us find out the effect of à sextupole on 
the second order phase flip factor. Let us consider a sextupole with strength 
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ks = b3/ Bp. For a thin slice of it, the effect is Aa = —k,dsx?. From eq. (11.1), 


we obtain the change of the periodic solution of the second order dispersion 
function at the location of the sextupole slice, which is 


AD3N — k,dsD? B cos (sv) 
ADS) 2sin(zv) \ sin (nv) — a cos (nv) jJ ' 


where v = 6/27. Plugging it into eq. (10.6) and using eqs. (10.7) and (10.8), 
we obtain 


1 
An” (s) = c Cle)AD2 + (Ila) AD] 
1 kds D? 


= -O2sn(av) (az) D + (1 — (x|x)) D'] 8 cos (rv) 


— [(1 — (a|a)) D + (x|a) D '] (sin (rv) — a cos (nv))) . 


After straightforward algebraic and trigonometric manipulations, we arrive at 
a simple result, which is 


1 
Anb® (s) = cs D'ds, 


and the total change of the second order phase slip factor is 


1 C 
Ant = = n ks (s) Dds. 


Using the same method, we can easily obtain the result where both horizontal 
and vertical dispersion are present but coupling is corrected. Here the kick of 


the sextupole slice is 
Aa x? — y? 
i53 ce ( —2xy J’ 


and the change of the periodic solution of the second order dispersion function 
at the location of the sextupole slice is 


[o _ ksds (D2 — D?) | Bx cos (TV) 


AD!, 2 sin (7v4) sin (774) — Qx cos (114) 
ADy2\ _ ksdsDsDy By cos (Ty) 
ADS sin (71) sin (1v) — ay cos (my) J` 


Plugging it into the 4D version of eq. (10.6), using eqs. (10.7) and (10.8) and 
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the similar relations for (!|y) and (I|b), we obtain 


Ang (5) = 5 [rA Dao + (a)AD!y + (ly) ADy2 + (116) AD 4p] 
= FE LP (lafe) Da + (1 — (ra) DI] Be cos (va) 
- [(1 — (ala) De + (la) Di] (sin (zv) — as cos (zvs))} 
Canter ([090D, + (1 — (vig) D5] By cos (zv) 


= [a — (b|b)) Dy + (y]b) D] (sin (11) — dy cos (wvy))} : 
Taking into account the fact that the vertical part in the curly brackets is the 
same as that of the horizontal part, we immediately arrive at the final result, 
which is 
1 re 
Arb? = T J ks (s) (D3 — 3D, D?) ds. 
0 


10.3 Longitudinal Dynamics 


Based on the previous two sections, we can construct the one turn map 
with the RF cavity present. From eq. (10.2), we can write the general form 
of energy gain 

AK (r,t) = qVo (r) cos [o (t)] , 


where ¢(t) = wt. For the TMo1o mode of a pillbox cavity, we have 


Vo (r) = EoJo (=) LT, 


where L, instead of l, is used to represent the length of the cavity to avoid 
confusion. Converting to the canonical coordinates, we have 


$ (I) = do + 1, 
K 


and 
7 Ko _ AK (7,1) - AK (0,0) 
$6 (5) =F TARO) Kor AK (0,0) 
Ko qEoLT 


m + qEo LT cos (do) ia Ko + qEoLT cos (do) 
. |^ (=) cos (0 + 21) — cos (on) . 


c 


Synchrotron Motion 253 


As shown in Section 10.1, the transverse focusing is negligible. Together with 
the fact that the change of velocity is insignificant, the cavity is simply a drift 
space for the variable x, a, y, b and l. As a result, we can treat the cavity as 
a thin slice with a kick in energy and the map of the cavity is 


Tg = Ti a = Poi 
25 f pof UE 
Poi 
jy bf = — by, lg — li, 
pof 
Ko qEo LT 


———P———— 
f Ko + qEoLT cos (¢0) k Ko + qEo LT cos (¢0) 


TETE 
a (ee cos (0 T Zy) = (n) , 


where 


poi = M (Ko + mc? — m?c*, 


pof = WV (Ko + qEoLT cos (ġo) + mc2)? — m?c. 


Obviously, the relative transverse momentum decreases as the particles are ac- 
celerated and so is the phase space volume, even though that for the variables 
(£, Px, V; Py, — AM, AK) is conserved. 

It is clear that when |¢o| < 7/2, the reference particle is accelerated and the 
relative energy deviation of the particles at the vicinity decreases on average. 
Together with the rest of the ring, we have the one turn map, which is 


MT - MEA o MAING, 


For the most general case, x, a, y and b are functions of 6 and l is a function 
of x, a, y, b and à. As a result, the cavity couples the longitudinal degree of 
freedom to the transverse degrees of freedom. Yet, due to the large difference 
in oscillation frequencies which will become clear soon, the coupling is much 
weaker than that between the horizontal and the vertical planes. This is par- 
ticularly the case when the cavity is located in a dispersion free region, where 
coupling is limited to the nonlinear part of the map. In reality, it is common 
practice to place cavities in dispersion free regions to achieve separation of the 
longitudinal and the transverse motions. In the rest of this section, we always 
assume that there is no dispersion at the location of the cavity and ignore the 
chromatic terms in the nonlinear part of the transverse map (x, a, y and b). 
Furthermore, we ignore the spatial dependence of the accelerating field due 
to the fact that r «& Re and the difference is second order in r. Consequently, 
the longitudinal degree of freedom is decoupled from the transverse degrees 
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of freedom and the longitudinal map is 


ly =1,4+M, (ôi), 
j= Ko a qEo LT 
fT Kot qEoLT cos (ġo) ^ Ko + qEoLT cos (¢o) 


: [cos (0 + 2i) — cos (40)| ; 


Next, we keep only the linear terms in the map and solve for the oscillation 
frequency. The linear map is 


lf =li + (1]6)ó;, 
EN Ko ô; 
~ Ko +qEoLT cos (ġo) | 


qEoLT gus " 
Kap at 0599585099. 


of 


Writing in matrix form, we have 


A [1 U/l 
bp} \ Msa Mss] \ 6)” 


where 
qEoLT TM 
Me — BELT v 
97 Ko + qEoLT cos (do) & sin (o) , 
1 Ww 
Msi = eer EEN [K ESLTC (16) si l 
G Ko + qEoLT cos (ġo) o + 40 m |ô) sin (ġo) 


It is easy to verify that the determinant of the matrix is 


Ko 
Ko + qEo LT COS (ġo) ] 


which entails that the motion is non-symplectic when qEoLT cos($o) 4 0. 
Furthermore, the longitudinal emittance of the beam decreases as the beam is 
accelerated. It is obvious that the same is true for the transverse emittance, 
which is called adiabatic damping. By redefining the relative energy deviation 
as 


we obtain the new matrix 


6) "n a (3) 
Of Mz, M5; T d 
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where 
qEoLT w | 
Msg = Ko z sin (o), 
1 Uu. 
My = x- [Ko + aEoLT 7 (1/5) sin (0)] , 


which is symplectic. The trace is 


ay ee ee [Ko + qEoLT= (05) sin (9). 
Ko K 


qEoLT w, x 
Ko 


=2+ (13) sin (do) . 


K 
For (Iò) > 0, tr M < 2 if 0 < do < m; for (1|ó) < 0, tr M < 2 if =r < ġo < 0. 
In other words, for energy below transition, the synchrotron motion is stable 
if the synchronous phase $9 € (—7,0) and, for energy above transition, the 
synchronous motion is stable if o € (0,7). If we restrict ourselves to the case 
of acceleration, the stable interval of the synchronous phase is (—7/2, 0) below 


transition and (0, 7/2) above transition. This is the quantitative statement 
of the fact mentioned in the last section. Using the relation 


tr M = 2 cos (u) , 


we have 
qEo LT 
2Ko 


It is worth noting that for most accelerators the relation 


(—&) (10) sin (o) 


Ht = arccos |1 — 


qEoLT 
2Ko 


(—5) (15) sin (do) < 1 


holds. Making use of the fact that 


= TET 2 
arccos x = arcsin y 1 — x7, 


we obtain 
arccos (1— x) = arcsin / 1 — (1 — x)? = arcsin (v 2x — 2?) =, V2r. 
As a result, we have 


| [qEoLT 
2 k 


Ph sin 
(-&) (18) sin (4o) = TE sin (io) 


Ht 


Note that h is the so-called harmonic number, which is the ratio of the RF 
frequency to that of the revolution frequency of the ring wo, and np” is the 
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first order phase slippage factor defined in eq. (10.3). The quantity ju is 
called the synchrotron tune which is proportional to the square root of the 
harmonic number, the accelerating voltage and the slippage factor. Usually 
the slippage factor is expressed in terms of Ap/po, as defined in eq. (10.4), 
denoted here as rj. Hence the relation between the two is 


Yo p 


h 
m ie ni. 


BELT Yo 
Taking into account that us is the phase advance per turn, the synchrotron 
tune in terms of revolution per second can be written as 


1 [27h (qEoLT) nj sin ($0) h (qEoLT) nj sin (ġo) 
Ma Al T mnm. VOA = enna 
to Bóyomc 27 Boome 


which is the usual form that appears in most textbooks. For a circular ac- 
celerator with GeV level energy, the synchrotron tune w; is usually between 
0.1% and 1% of the revolution frequency wo. The betatron tunes, on the other 
hand, are usually between a few to a few hundred times of wo. As a result, 
the coupling between the betatron and the synchrotron motions is usually 
weak. As an example, let us take a look at the storage ring of the Advanced 
Light Source (ALS) at Lawrence Berkeley National Laboratory (LBNL, LBL), 
California, USA, which is an electron machine that operates at the energy of 
1.9 GeV. The main purpose of the RF cavity is to restore the energy loss due 
to synchrotron radiation from bending magnets and insertion devices, which 
is on the order of 0.5 MeV per turn per electron. The harmonic number is 
328 and the slippage factor is roughly 1.4 x 1078. As a result, we have 


Y 328x L4 x 10-3 x 0.5 Us 
i MN ILLUS C M IHE RU. eee og 
is On x 3718 x 0.511 ZN 


Using eqs. (8.1), we obtain 


qEoLT w ; 1 
ic ccm LE 
. (Quà)  — (ld) 
Br-- =| —, 
SIN Ut Ht 
u qEo LT Ww " u Ht 
m Kosin p K sin (0) =1 TUN 


Since uu; « 1, we have a; « 1. Consequently, the invariant ellipse is basically 
upright. For a given longitudinal emittance e;, the maximum bunch length is 


vo vo [(l|ó)e 
lmax =-2 =, —2 , 
E V Bree =1 " m 
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FIGURE 10.5: Phase space plots of longitudinal motion with ¢, = 7/2 
(left) and ¢, = 7/3 (right). 


and the maximum energy spread is 
Meet 
= 2) —12 3 
Ômax Veet 51 (1/5) 


Using the relation 
KC 
(8) = —Ktont! = nf; 


we obtain ; 
Th pe ome? a 
lnax = =2V/C a DART T ae ey ee , 
or (= h (qEoLT) sin =) 
and 


Ümax = we 
K 


99e fE 2th (qEoLT) sin (do) i 
C ni Boo mc? 


The main result is that lax ox (7? E if other parameters remain unchanged. 
Hence, one way to reduce bunch length is to decrease the phase slippage factor. 
Fig. 10.5 shows the details of the dynamics of all amplitude for ¢, = 7/2 (left) 
and $, = 7/3 (right). Note that particles that are outside the stable region 
(called the RF bucket) lose synchronicity with the electromagnetic field in the 
RF cavity. For the case ¢, < 7/2, those particles would not be accelerated 
the same way as those inside the RF bucket and usually will be lost. 


10.4 Transverse Dynamics of RF Cavities 


Up to now, we have only treated the main effect of RF cavities, which is 
to accelerate (or, occasionally, decelerate) charged particles. We have shown 
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FIGURE 10.6: Longitudinal (solid) and transverse (dotted) field distribu- 
tion along the longitudinal axis. 


that, in order to accelerate the charged particles effectively, the synchronous 
phase has to be a stable fixed point, limiting the synchronous phase to just 
one quadrant (see Section 10.2). As shown below, this has significant effect 
on the transverse dynamics of linacs. In rings, the transverse effect of RF 
cavities is negligible due to the presence of magnets. 

Let us start with the longitudinal distribution of the electromagnetic field 
of a pillbox cavity with small holes at the center of each end, which will 
be derived from Maxwell’s equations. From eq. (10.1), we can express the 
longitudinal component of the electric field as 


E, (z,t) = Eo cos (wt + e) H (z + L/2) H (D/2 — z), 
where L is the length of the cavity and H is the Heaviside step function. From 
V- E — 0, we obtain 
10(rE,) E OE, | 
r Or Oz 


After integration, we obtain the leading order transverse component of the 
electric field, which is 


DM = —2 Fy cos (wt + p) [5 (z + L/2) — 6 (z — L/2)]. 
2 Oz 2 

Fig. 10.6 shows the longitudinal dependence of E, and Ey at an instance. 
From (V x B), = (1/c?) (OE; /0t), we obtain 


10(rBo) _ 1 ðE; 


r Or c? Ot” 
Similarly, we obtain the leading order magnetic field, which is 
r OE, wr , 


From the Lorentz force law, eq. (1.1), we obtain 


dpr 
m =q (E, — v,DBs). 
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As a result, the change in transverse momentum across the cavity is 


L/24& E 


—L/2—e& Uz 


T2 


--in 2 cos (wti + p) — 
Uz1 


z 


cos (wtz + 2 
Uz2 


-Z Eo Iz cos (wt; + y) — 72 Cos (wta + e 
1 
T L/2 
+ $F | dzr (z) sin [wt (z) + y], (10.9) 
€ —L/2 
where r1, 041, t1, £1 and r2, v;3, te, B2 are values of r, vz, t, B at z = —L/2 


and z — L/2, respectively. When the charged particle is non-relativistic, i.e., 
B « 1, the contribution of the magnetic field is negligible. Hence eq. (10.9) 
becomes 


Ap, = E Iz cos (wt; + p) — 72 cos (wt2 + v)| , 
2c | Ay B» 


where tı = — Jud dz/[B(z)c] and t2 = ae dz/|B(z)c]. Note that the origin 
of t is set at the moment when the particle is located at the center of the 
cavity. If we make one more assumption that |82 — 81| /8(0) « 1, which 
means that the energy gain (loss) through the cavity is much smaller than the 
total energy of the particle, we obtain that tı = —L/(289c) and t2 = L/(28oc) 
(Bo = 8(0)). Consequently, we obtain 


q rı wL T2 wL 
Ap, = Es Iz cos (= — e) Ex cos (= + e) 
q E rı TL T9 TL 
NP [Z cos (= e) grim (= +e) ! 

Let us take a look at drift tube linacs as described in Section 1.3.2. Phase 
stability requires that —7/2 < p < 0 (see Fig. 10.3). Efficient use of 
energy requires that the particles are accelerated throughout the gap, i.e., 
cos (p — 1 L/ (BoÀ)) > 0 and cos (y 4- *L/ (89A)) > 0. As a result, the par- 
ticle is focused at the entrance of the gap and defocused at the exit, as 
shown in Fig. 1.13. Furthermore, we note that cos(y— *L/(B9A)) > 0 
entails that p — *L/(foX) > —7/2, which leads to «L/ (89A) < p+ 1/2 
and y+ zL/(foÀ) < 294+ 7/2. If —/2 < p € 1/4, p+ «L/ (Bor) < 0 
and we obtain that cos(y+7L/(8A)) > cos(y—7L/ (89A)) . If —7/4 < 
yp <0, p+7L/ (89A) can be positive where cos (p + x L/ (8oA)) decreases as 
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p+7L/ (89A) increases. From the fact that 


TL | TL | E TL (= 


Edi pe e ee —9 
Bor Bor Bor e) pat 


v 7 AA 
we reach the same conclusion that cos (v + 7 L/ (894)) > cos (p — *L/ (BoA)). 
Although we have £1 < b2, yet 82/60, is usually much smaller than cos(q + 
T L/(BoÀ))/ cos(y — v L/(89A)). As a result, the net effect is defocusing. The 
remedy in the early dates was metal foils or grids placed on the entrance of 
the drift tubes (exit of the accelerating gap) to remove the defocusing force 
as shown in Fig. 1.13. Nowadays, quadrupole magnets are placed inside the 
drift tubes to provide transverse focusing. 

At higher energy, the particles become relativistic and the contribution of 
the magnetic field has to be taken into account. Yet the matter is simplified 
somewhat by the fact that the difference between 3; and 82 can be neglected. 
Assuming also that r remains a constant in the cavity, we obtain 


qr aL TL ) 
Ap, = ——— Eg |cos| —— — — cos | — + 
Pr 3Boc || [o 2 a on 
L/2 
qur . 2T 
—E d 
* 328 E esn (5 +9) 
L 
= E T e sin (s a) sin py + Por gi sin (=) sin Y 
qrEo . 
= — sın in 
Borac BA a me 


mTqEoT Lr. 
= —— 5 sin y. 
pagea R 


Again, for the stable phase of —7/2 < y < 0, the net effect is defocusing. It 
is worth noting that, for ultra-relativistic particles, this effect goes away. As 
Bo — 1, the defocusing from the electric field is canceled by the focusing from 
the magnetic field. 


Chapter 11 


*Resonances in Repetitive Systems 


Unlike single pass systems, the dynamics of the beam in a repetitive system 
such as a storage ring is not necessarily dominated by the largest aberrations 
in the one turn transfer map. Due to the fact that particles go around many 
times, the impact of those terms that are nearly in phase with the linear 
motion are amplified and the dynamics is, to a large extent, shaped by 
them. The motion generated by one of those terms is called a resonance. As 
a result, we need a different way to evaluate the relative significance of the 
aberrations that is directly suited for rings. 


Since resonances appear in various physical systems, many different meth- 
ods have been developed to describe this phenomenon. Here we have adopted 
a method that is based on the map method that has been developed here and 
does not require advanced techniques such as normal form theory as in [5]. 


As shown in the previous chapters, a large class of single pass systems 
are imaging. Yet the entire ring cannot be imaging, since it will be linearly 
unstable if |M| Z 1, where M is the magnification. Moreover, the ring is 
unstable with the presence of arbitrarily small errors when |M| = 1. 


11.1 Integer Resonance 


In this section we will study the dynamics in a ring when one or more dipole 
magnets have errors in the field. Let us first consider the case that one magnet 
has a dipole error in the field, which is AB. Without lost of generality, we 
adopt the thin lens approximation, since a thick dipole can always be cut into 
a number of thin slices and the contribution of the whole is the sum of each 
slice. The kick resulting from the error is ABl/Bp. Thus the position and 
angle after one turn is 
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where M is the one turn map of the ideal ring and the kick happens at the 
end of one turn. Similarly, the position and angle after two turns is 


esas) PS 


Now let us assume 


Tn—1 (7 y oe: ^rn—2 0 ^rn-1i[ XO 
(231) = (tes +M ) (anys, ) +% Gy 


and we obtain 


1 1 — cosnu — asin npu — sinnu 
— 2(1— cos u) y sinn 1 — cosny +asinny 


1 — cos u + a sin u DB sin 

l —ysin 1 — cos u — a sin u 
E sin(ni/2) [ sin(nu/2) — acos(np/2) — f cos(nu/2) 

sin(u/2) ycos(nu/2) sin(nu/2) + a cos(nu/2) 

l sin(u/2) + o.cos(u/2) B cos(u/2) 

=y cos(u/2) sin(u/2) — a cos(u/2) 

. sin(nu/2) 5 (n—1)/2 
“Tsing 


Note that 


gp — cos u (n —1)/2 + asin u(n—1)/2 B sin u(n—1)/2 
—y sin u(n—1)/2 cos u(n —1)/2 — a sin u(n—1)/2 
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When u — 27k, sin(nu/2)/ sin(u/2) — n. The result is that the motion is 
divergent in phase space and eventually the beam will be lost. In other 
words, the particle is called in resonance when u = 27k for some k. Since 
errors in dipole magnets are unavoidable, the only way to avoid this resonance 
is to adjust the tune away from any integer. 

In theory, the phase space coordinates become arbitrarily large only when 
the tune is infinitely close to an integer. But in practice, the finite size of 
the beam pipe defines a finite interval around an integer such that the beam 
will be lost, which is called the stop band. In order to determine the stop 
band of a given machine, the size of the beam pipe and the dipole errors of 
the magnets have to be known. The source of dipole errors can be either 
the field errors in the dipole magnets and dipole component generated from 
misalignment of multipoles (quadrupoles, mainly, and sextupoles to a lesser 
extent). Since the dipole errors act on every particle the same way, it is 
sufficient to study the motion of the so-called centroid of the beam only. In a 
ring, this centroid, which is called the closed orbit, is the periodic solution of 
the one turn transfer map. With the presence of dipole errors, the one turn 
map is not origin preserving. Because the distorted closed orbit is usually 
close to the design orbit, only the linear part of the map is included in the 
treatment. The periodic solution of a single error at so is obtained through 


the equation 
(2) = (apy Go) 2o) * (22). 


As a result, we have 


ee. (7- NC 


_ (ABI) (s) 1 B cosu/2) 
Bp X 2sin(u/2) V sin(u/2) — acos(u/2) J ` 


From eq. (6.10), we obtain the closed orbit at an arbitrary location s 


Leo(S) =V Bs/Bso (cos ġ + asin $) zo + y Bs Bso sind ao 


ABI " " 
= ec [VB Bo (cos 9 + aso sin $) Bso cos $ 


BsBso sind (sim 5 — Q0 COS 2) 
(ABl) cos (nv — ¢) 
Bp 2sin(u/2) ' 


where ¢, Bs, Bso, Aso and (ABl). represent ó(s) — (so), G(s), B(s0), a(so) 
and (ABI) (so), respectively. When n dipole errors are present, the closed 
orbit is 


bool) = zo © SANE) cos[u/2 - (618) = (6m) 


m=1 


(11.1) 
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11.2 Half—Integer Resonance 


In this section we study the effect of quadrupole errors. Again, let us 
first assume that only one quadrupole has an error in the field gradient, which 
is denoted as Ak. Without loss of generality, we assume that this quadrupole 
is located at the end of a turn and that it is thin. As a result, the linear one 
turn map is 


HA = 1 0 ^ XO 

(2) = es 3 m) 
u 1 0 cos u + asin u Bsin pp Zo 
|»A—Akl 1 —ysinu | cospu — asinu "p 


In order to simplify the calculation, we adopt a new coordinate system such 
that the unperturbed motion is a rotation. As a reminder of the general 
theory of transformation, let us assume that a matrix A transforms Z into y, 
ie, Y = AZ. Assuming another matrix M is the linear one turn map in the 
space of z, which means that 


x, = MZo. 
Multiplying A to the left and inserting A-!A between M and To, we obtain 
AZ, = ÂM Â! . Azo, 


and hence M 
= AMA Hj 
It is straightforward to verify that 


3 (VP 0 
o/v/B VB 


1//B 0 cosu c-oasinu Bsing JVB 0 
a/ VB VB —ysinu | cosu — osinuJA —a//8 1/4/B 


2 ( eis iu) = Ru). 


— sinu cos 


implies 


AMA = ( 


The relation between the coordinate systems is 


oR va) (2) 
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The one turn perturbed map in the new coordinate system is 


oala pea det) 
di —Akl 1 ao —Akl 1 ao 
1 O)\. To 
= E 1) Bw Ga 


where AK above is defined as AK = Aki. After n turns, the coordinates 


are 
En 1 OV anc) E 

In order to illustrate clearly the nature of the dynamics, we treat the problem 

perturbatively. To the first order, the coordinates are 


(z) - (Ru) + Ge SCORR e SCO 
ene n) (oe ;)ito 
= E Mu ssepe eo xus 
VC f sinmucos[(n — m)u] | sin mpsin [(n — m)y] Yo 
zi 2 ee [((n — m)u] cosmpsin [(n — 22] (2) 
= [Rom AK (olya signe) etn n (ane cene) 


E SMC sim (2m — my] TO" os [(2m — n)a] ) s) 
Enma cos[2m — n)a] -Ema sin [(2m — n)a] 


In order to simplify the expression further, we take advantage of the trigono- 
metrical series 


n-1 _ sin[r + y (n — 1) /2] sin (yn/2) 
2 one +my) = | —  — sin(y/2) ^ 


n—l _ €os [z+ y (n — 1) /2] sin (yn/2) 
2 (£x + my) = o smy - 


266 An Introduction to Beam Physics 


Specifically, we have 


Y sin (2m — n)y] 
E. PGs) ne (n — 2) uy = sni- 2) u + 2my] 
sis[- (n— 2) e /2]- sin [fn - 1) "a 
sin u , 
and 
Y cos [(2m — n)u] 
zy epica) (n—-2)u =F cos (n — 2) w+ 2mp] 
esk (= 2) 4+ 2p (m2) /2)-sinf(n sina 
sing sin u 


Hence the final result is 


Zn d 0 0 
~ | =|R(np)-AK f 
ün cosn sinnu 
AR, 1) sinnu — cosnu AK sin((n— 1)u] ( 0 1 Xo 
———(n — = i 
2 cosnu sinnu 2 sin u 10 do 
When u — 27k or 2z(k + 1/2), sin[(n — 1)u]/ sinu — n — 1. Hence the 
resonance is called half-integer resonance. 
When the tune is a certain distance away from the half-integer, the motion 
is still stable, but the invariant ellipse and the tune change. To obtain the 


perturbed invariant ellipse and tune, only the one turn map is needed. From 
the one turn matrix 


We 1 0 cos u + asin u DB sin 
~ A—Akl 1 —ysinu cosu—asiny 


" cos y + a sin y sinu 
~ \—ysin u — (AK/8) (cosu +asin p) —AK sin u + cos u — asin u J^ 
we obtain 


2 cos (u + Au) = 2cos u — AK sin p, 
cos (u + Aw) + (a + Ao) sin (u + Ap) = cos u + asin p, 
(B + AB) sin (u + Ap) = f sin u. 
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As a result the changes are 


( AK , ) -êK sinu AK 
Ap = arccos | cos p — —— sin p | — u =; —— = —, 
2 — sinp 2 
Aa cos LL + asin y— cos (u+ Ap) ED (a+ AK/2) sin w A 
7 sin (u + Ap) sin cos (Ap) + cos usin (Ap) 
AK AK AK 
=, |a+— 1— ——cotu | —a =, — (1 — acot u), 
2 2 2 
sinu sinu 
Af = —————— d = -— 
P sin (u + Aj) £ sin cos (Ay) + cos wsin (Ap) j 
AK AK 
= 8 (1- ÊE cot) -8 = - 2 cott 


where we remind ourselves of AK = Akl. The final relations are 


AK 
ERA T 
AK sin u — a cos u 
Aa =1' =.= n ; 
2 sinu 
AB AK cosy 
B 7 2 siny 


It is worth noting that the invariant ellipse becomes infinitely large when 
v — k/2. Like the closed orbit, the invariant ellipse is the periodic solution. 

Now let us extend the calculation to multiple errors. The one turn matrix 
is 


gi E ee E p oa 
i = tio (_ (aa, 1) ( (aan, 1 ia ores 1) Mo, 
where, from eq. (6.9), 

"M JB; 0 PE 1//B; 0 

VEA "n wm) mend We a) | 


The factorization gives us a way to simplify the calculation by working in 
the normalized space where the matrices between the kicks are rotations. 
Specifically, we have 


M= v Bo 0 M. 1/VBo 0 
—ao/v Bo 1/VBo ao/vV Bo vBoj ' 


and, denoting AKm = (Akl); fs, 


[s = È ($no) a 4 “+++ + Rd) ee J R (¢o1). 
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To the first order of the errors, the one turn matrix becomes 


COS mo COS Qom COS mo Sin dom 


siny — COs pL qs sin {lm COS [lm 
K -Y` AK. , 
is iens d 2 2: S 2 


m=1 


| sindmo COS om SİN mo SİN dom ) 


Il 
m 


m 


where ji denotes 
Lim = bh 260m. 


The change in tune can be obtained through the relation 


Ec d i 
cos (u + Ap) = 2 tr (a1 ) = cos — E 5 AKm | sin p, (11.2) 
m=1 


which is, to the first order of Ak, 


If the change in focusing is due to the difference in momentum, this equa- 
tion gives the chromaticity. Since the tune is changed due to the presence 
of gradient errors, one important question is how far away it has to be from 
an integer or a half-integer in order to maintain stability for a given set 
of errors. This interval in the tune space that the motion is unstable is 
called the stop band. Assuming that the errors are small, the stop band 
Au is small, too. As a result, the unperturbed tune can be written as 
u = 2n(pt+e) or p = 2nx(p+1/2+e), where e is small. Therefore, we 
have sinu =; +276. Plugging it into eq. (11.2), we find that the term 
— [(1/2)- 0" _, AK] sin i =2 Fre $r; AKm, which is a second order one. 
Hence we have to go one step further to include the contribution up to the 
second order of Ak. The one turn matrix in the normalized space is 


|f npe sinu — cosu 1X sini cost 
M -RQ) - 3 3: AK (e sinu 7-32, AK» cos fi — sin ji 


m=1 m=1 


4s ` AK AK mR (¢mo) G o ) Ê Omm) [s o ) Rn) 


lm-—1l,l«m 
-R( jx AN sinu — cosp by AR sind cos 
zl 2 "cop sinp 2 ™ \ cos fi — sin fi 


COS mo Sin Qj, COS G19 COS Omo Sin dim Sin Gio 


ü bee sin dimCos dio sin mo sin dim 2 


Resonances in Repetitive Systems 269 
The trace of the matrix is 
cM 2 it 
tr U ) —2cosu — | M5 AK, |sinu + 35 AK(AK,, sin dim sin(u — dim). 
m=1 l,m=1,l<m 
To the second order, the trace is 
— 
«(at ) =42]1—277e oss ANS 
m=1 
+ XO AK AK sin dim sin (27v — Pim). 
l,m=1,l<m 
The last term can be simplified, which is 
S AKIAK,, sin Qim sin (21v — dm) 
l,m=1,l<m 
1 n 
= 3 5 Aki AK» [cos (2655) — 1] 
lm-—1 
1 TL 
= 5 AK AK yy, cos (27) cos (2Y m) 
lm-—1 
1 n . . 1 n 
T 2 ARAS sin (207) sin (29/5) - 7 = AK | AKm 
XO AK in cos (2V in) M AK yn sin(2Vm)| -—| 5° AKA], 
m=1 m=1 m=1 


where Ym is the phase at the mth quadrupole. For the integer stop band, we 


have 


tr (ur) =? 


1 — 277? — ne 5 AKm 


m=1 m=1 
AK, sin (2W,, AK. 
[orem] HIE 
m=1 


The unstable region of the tune is determined by the relation 


n 2 
Qn? + Te 5 AKm-—- = 
m=} 


5 AK, cos (2Y m) 
m=1 


n 2 
= | XC AK, sin (2U,,) 
m=] 


m=1 


5 AK, cos (2Y m) 


A 2 
5 AK, <0. 


2 
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Denoting 


2 


2 n 
Y. AK, sin (2Wm) 


m=1 


+ 


, 


Se AK, cos (2Y m) 


mci 


we see the unstable interval of the tune is 
Au 


S 


27€ + I Y AK, 


m=1 


and Au is called the integer stop band. Similar calculation shows that the 
same expression also gives the half-integer stop band [16]. 

In order to obtain the change in the invariant ellipse, we have to go back 
to the original space, where 


oaj vA — 90 ) uU o 
— ao/ vV Bo 1/4 / Bo ao/ v P VBo 
zw x Bo sin u ) 


— yosin y COS LL — Qo Sin [4 


us sin u — ag COS H — bo cos u 
-3X an. ( 


1 Yo COS LL sin u + a cos u 
i= 
i sin jt + a9 cos i Bo cos fi 
9 = AK, 1— c vi 2 un as 
mai (1 — aĝ) cos fi/89 + 2ao/Bo — sin ñ — ao cos ji 


Immediately we have 


cos (u + An) + (ao + Ao) sin (u + An) 
! E m z 
= cos u + Qo sin p — » 5 AK, [sin u — ao cos u + sin fi + ao cos [i1] , 


m=1 


(Bo + AB) sin (u + Au) 


; i= m 
= Bo sinu — 5 5 AK, [—8o cos u + Bo cos ji] . 


m=1 


To the first order of Ak, we have 
1 

2 sin pL 
AB ol 

fo  2sing 


n 
5 AK, [Sin im + ao cos f] , 


m=1 


Aa = 


n 
5 AK m COS im, 
m=1 
and we remind ourselves of fim = H — 2¢o0m and AKm = (Akl) mm. It is 
clear that when the tune is close to the half-integer resonance, the size of the 
beam becomes larger and eventually goes to infinity as the tune approaches 
the half-integer. 
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11.3 Linear Coupling Resonance 


Linear coupling refers to mixing of linear motion between the horizontal 
and vertical planes. Coupling between transversal and longitudinal motion is 
present as well, but it is usually weaker. Linear coupling between transversal 
planes arises from solenoids, roll of quadrupoles, vertical misalignment and 
orbit offset at sextupole locations. In this section we study the effect of skew 
quadrupoles, which can be present intentionally or due to the roll of normal 
quadrupoles. Again, let us first assume that only one skew quadrupole is 
present, which is denoted as k,. Without loss of generality, we assume that 
this skew quadrupole is located at the end of a turn and that it is thin. As a 
result, the linear one turn map is 


X1 1 0 0 0 Xo 
ay | — 0 1 kl O} «4 ao 
Ui E 0 0 1 0 M Yo : 
bi kl 0 0 1 bo 
where 
COSI d- Or, Sin pus By Sin pua 0 0 
^ —Ye SM Hy ^ COSA; — Org Slnjus 0 0 
M= F i 
0 0 COSHy + cry SI ky By sin [by 
0 0 —Yy SİN Uy COSHy— QySİN Hy 


Applying the same coordinate transformation, we obtain the normalized co- 
ordinate system, which is 


a B Ox / V bz V Br 0 0 a 
yg) 0 0 1//By 0 y 
b 0 0 ay/V/By V/By/ Nb 
As a result, we have 
Tı 1000 COS Uy SÌN [Ly 0 0 Xo 
ài zb CERE 0 —sin fiz COS ia 0 0 do 
| [0010 0 0 — cosu, sinfly | | Vo |’ 
bi K001 0 0 —sinply cospy/ \ bo 


where K above denotes 


K = ksly/ Be Biss (11.3) 
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To shorten the equations, let us use again the following symbol 


COS [lz SİN Hy 0 0 
^ ur) — SİN Hy COS Ly 0 0 
R = ; 
[by 0 0 COS Uy SiN [ly 
0 0 —Sin fy COS py 


After n turns, the coordinates are 


n 


En 1000 Zp 
us icd FO) aT a do 
in} [0010 a To 
b, K001 bo 


To the first order, the coordinates are 


Tn 0000 
ün A 00 K0]. 
«| Poss E) 
Un Nuy 0000 Nuy 
bn K000 
0000 
A 00K 0]|. = 
+Ê R 4e 
@ 0000 0) 
K000 
0000 To 
E = 00k 0 a 
n-1)mu,/|0000 Hy Jo 
K000 bo 
0 0 0 0 
|» | Sf nita 0 0 cos(nj) sin(nuy) 
x io) d ME. 0 0 0 
cos(nju) sin(nju) 0 0 
00 . ds 
n-i]| 00 Run on 
+K eo ie 
2, ^ 00 Yo 
m= RLL 
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Ê= cos |(n — m) ua] sin (muy) — sin|(n — m) uz] sin (muy) 
cos [(n — m) pa] cos (mj) sin [(n — m) pa] cos (mpy) J 

T cos|(n — m) uy] sin (Muz) sin [(n — m) poy] sin (mp) 
UP | cos [(n — m) pty] cos (mps) | sin [(n — m) pry] cos (mus) J ` 


The nonzero terms in the transfer matrix can be further simplified using 
trigonometric relations. Therefore we have 


x cos [(n — ES cos [(n — m) fz] sin (My) 


M) [tz] Sin (muy) 


= Y {sin [nye — m (pa — i) - sin [nyis — m (pie + 1I) 
m=0 
B TN sin [n (uz — ny) /2] 
=5 {s | Uz 5 | 1) (us i) sin [(ux — uy) /2] 
wr | sin [n (Me + uy) /2] 
S | Hx 5 | 1) (Hz ' n) sin [Cie + Hy) /2! j 


= I {sin [z^ (Ha m) cos ls (Ha — i) luec 
n (Us + by IE JE n (e n) 


~ ty) 5] 5 (ie +m) RE ta) 


— sin sin [(us + fy) /2] 


+ cos [5n 
[n 

-oos sni 
E ; {sin [z^ 


si : 
agin lcs 
2 


cnn [een 


(Me + k cos ls (Ha — is) 


n (p — i) cos ls (us + is) 


When py > xz + 21 N (N is an integer), we obtain 


»» cos | (n—m 


)n] sin (muy) ^ 5 ( 


sin [n (te — 
sin [(u, — 
sin [n (ite + ny) /7] 
sin (us + Hy) /2] 


Hy) /2] 


Hy) /2] 


= sin (au) } 


1 
n — 1) sin (nuy). 
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Similarly, we have 


2» cos [(n — m)us] cos (mu) a5 cos [(n — m)us] cos (mj) — cos (nuz ) 


=o [betes om [Se] ect 
sin [n (Hx + ny) /2] \ 


+ cos [z^ (us — n) cos $ (He + m) sin [(H2 + uy) /2] 


— cos E (Ha + n) cos [z^ (Ha — p) ; 


so we obtain when py > cp; + 27N, 


sin [(n — 1) uy] 


1 
» cos [(n — m)us] cos (muy) > 5 fo — 1) cos (nuy) + B 
y 


Furthermore, we have 


n-1 


z sin |(n — m) uz] sin (Mpy) a sin [(n — m)uz] sin (muy) 


Aprender] hec 


sin [n (us + My) /2] 


sin [(us + My) /2] 


+ cos E (us — i») cos E (ua + n) 


and we obtain when pry > cus + 21 N, 


» sin [(n — m)us] sin (muy) > p fo Siege M \ 


Sin [ly 


Lastly, we have 


2 sin [(n — m) uz] cos (muy) m sin [(n — m)us] cos (mj) — sin (np) 


- fan [En + on [1 27] een 
sin [n (oe + My) /2] _ sin (ane) } ; 


- sin [z^ (us — i) cos E (us + i) sin [(us + fy) /2] 


and we obtain when jt, > cus + 22 N, 


z sin [(n — m)us] cos (muy) > + £l (n — 1) sin (npuy). 
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The upper right block of the matrix can be obtained by switching uy and 
fly Of the results above. In summary the transfer matrix at the limit of 
Ly — xps + 27N is given as 


z ^ 1. 1 ^ 
n- bienes So nan]. 


2 
where 
In Xo 
p ün k do 
Zn = ~ ; 20 = ~ ; 
Un Yo 
bn bo 


0 0 : 
COS (Nuz) Sin (Nuz) 0 0 
0 0 E Sy 
^ 0 0 s 0 
d m 0 ts, O0 0 |" 
s 0 0 0 
0 0 sin (Nur) F cos (Nus) 
"M 0 0 COS (Nus) + sin (nj) 
Macs sin (nuy) F cos (ni) 0 0 ] (a) 
cos (nuy) sin (nuy) 0 0 
and the coupling terms s; and s, above are 
2osu(n-Da] sin (n= 1) 
i Sin [Ly i bá sin Ju 


indicating that an arbitrarily small perturbation leads to arbitrarily large 
coupling between the horizontal and the vertical spaces. 

Similar to the half-integer resonance, the presence of the skew quadrupole 
component leads to a stop band gap in which the beam becomes unstable. 
To illustrate this point, let us first work the formalism of the eigenvalues of a 
4 x 4 symplectic matrix. 

Before we treat linear coupling, let us look at a few general properties of 
symplectic matrices and their eigenvalues. Recall that the symplectic condi- 
tion is 

MT JM = J, 
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where M is a 4x 4 real matrix. The eigenvalue of M can be obtained through 
solving 
det (st = AD =0. (11.5) 


Since M is real, 


MU = A? Mii = At Mt = Mg. 


So Ai is also an eigenvalue of M and 4f is the eigenvector. Keep in mind that 
det(AB) = det(A) det(B), | det(ÀT) = det(À). 
With these identities, we can transform eq. (11.5). 


det (xr = AD — ( — det J - det(J X — Af) = 0 => det(J MI — AJ) =0 
=> det MT . det(JM — AJ) = 0 => det(J — AMT J) = 0 

= det(f — AMT) = 0 — X" det(A1 f — MT) 20 

= det(M7 — A-1T) = 0. 


Therefore AT! is also an eigenvalue. Together with At, we reach the con- 
clusion that if M has an eigenvalue A then At, A71, Ai^! are also eigenvalues. 
We know that for M to be stable, |A| has to be smaller or equal to 1 for 
all eigenvalues of M. As a result, all eigenvalues of M lie on the unit circle. 
Apparently, we have, for this case, A = At! and \~! = Af. 

The next question we can ask is that supposing M is stable and every 
eigenvalue is reasonably far away from each other, what happens when M 
is perturbed? In terms of betatron motion, it means that uy and py are 
reasonably far away. 

To frame it in a more mathematical way, we say that there is a neighborhood 
around each eigenvalue so that there is only one eigenvalue in it. When M 
is perturbed its eigenvalues will move. They may all stay on the unit circle, 
or some of them may move away from it. Let us say one of them, A, moves 
away from the unit circle, then \'~! will also move away from it and be in 
the same neighborhood that A is in, making the total number of eigenvalues 
greater than 4, which is impossible. So the conclusion is that every one of 
them will stay on the unit circle (see Fig. 11.1). This is consistent with our 
experience, which is that when we change quadrupole strength, vz and jy 
change, but the motion is stable. 

The question that relates to linear coupling is what happens if two or more 
of the eigenvalues get close to each other? The answer is more complicated. 
It has been proven that when two colliding eigenvalues have the same sign of 
phase, motion remains stable after collision. Otherwise, instability may occur. 
In other words, difference resonance Hg — Hy does not lead to instability, sum 
resonance [tz + Hy does. 
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FIGURE 11.1: Distinct eigenvalues around the unit circle. Small circles 
show the neighborhood of each eigenvalue. 


Let us focus on coupling between horizontal and vertical planes. The matrix 
M is a 4 x 4 symplectic matrix, which we describe as 


^ AB 
M — ES A 
(6 5). 
where A, B, Ó and D are 2 x 2 matrices. From eq. (5.11), we have 
^ AC 
= A U 
w 


where A is defined as A = —JAT j, as well as B, C and D. Let us solve for 
the eigenvalue A of M + M~!. Obviously A = A + 1/A, because 
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we proceed and obtain 


EA : tÀA—A). 1 B+C 
det (I + M — AI) = det C m UN 
e D-A)-Î 

) 


= det [tr A —A)(tr D -AM — (tr À - A)f(Ó + B)((tr À - A) f) (B + c 
— det [(tr A — A) D - A)f-( E. + 
To find A, we have to solve the equation 


det [(tr A — A) D - A). i- (C+ BY c) - p. 


Now we have to find out what (Ô + B)(B + ©) is. 


necne t uc eq 
E cre. 
We have just shown that 
Ó-B-B«O 
'Then we obtain that 
(C+ B)(B + €) = det(Ó + B) f. 
So the equation for solving A becomes 
(tr À — A)(tr D — A) — det(Ó + B) = 0. 

'Then 

A? — (tr Â+ tr D)A + tr Â- tr D — det(Ó + B) = 0 

A=} rÂ tr D) a y EASED | det( + B). (11.6) 


This is the standard result of coupled motion, where the eigenvalue of the 
motion in one plane depends on the motion in the other plane and vice versa. 
The main features of eq. (11.6) are that A, — tr Â when tr Â > tr D and 
A, > tr D when tr Â « tr D; and that A... — tr D when tr Â > tr D and 
A_ — tr A when tr Â « tr D. 

Courant and Snyder [16] have shown that the sign of det(C 4- B) determines 
the stability of the motion when tr A~ tr D. Specifically, when [be — Hy = 27 N 
(difference resonance), det(C + B) is positive, A is real, A is on the unit circle, 
so motion is stable. When pr + fy = 27N (sum resonance), det(C + B) is 
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Ma 


Pa Hy 


Hy 


FIGURE 11.2: Crossing the difference resonance. Before: uy is constant 
and p, increases. After: pẹ and py reverse roles. Throughout the process, 
the ratio of the knobs (quadrupoles in synchrotrons) remains unchanged. 


negative, A can be complex, A moves away from the unit circle, so motion is 
unstable. 


Let us now look at the difference (us — Hy = 27N) and sum resonance 
(La + Hy = 21.N) separately. In case of difference resonance, we have 


(tr À — tr D)? 
4 


Ay, = e!» +e '!» = 2 COS Ug 


A, — Ay =2 + det(C + B), 


(tr A — tr D)? 
4 


tr À— tr D)? "PN 
— (cosi, — COS fly)” = rA + det(C + B). 


=> COS [lz — COS fly = + det(C + B) 


As a result, there is a minimum separation between the tunes when the 
difference becomes small, as shown in Fig. 11.2. The minimum separation of 
tunes can be used to determine the amount of coupling in an accelerator. 


In case of sum resonance, motion becomes unstable when 


tr À — te Dy? fe 
(r4 D < -ae(t +B), 
It is illuminating to compare the result above with that of eq. (11.4). There 
both sum and difference resonances show unbounded growth in coupling, yet 
only the sum resonance leads to instability in the 4D phase space. 


To illustrate this, let us follow Courant and Snyder [16] and work out the 
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one turn map when a skew quadrupole is present. 


10 0 0 COS [lz SİN [Ly 0 0 
p= 01k 0 — SİN [lz COS Hy 0 0 
0010 0 0 COS fly SiN [Ly 
K001 0 0 —sin fy COS [Ly 
COS [lz SİN Hg 0 0 


—sinu,  cosu, Kcospy K sin py 
0 0 COS fly sinfly | 


Kcosu, Ksinpig —sinfly COS py 
B-x( ME. J: cak NM. p 
COS [ly SİN Hy COS He SiN Hy 
and 


ĉ+B=x|( 0 0 )+( sin [ly oJ -«( sin [ly 0 ) 
COS Uy SİN Hr — COS fly 0 COS [iz — COS fy SiN [Lz 


Furthermore, we have 


Hence 


det(C + B) = K? sin pz sin Uy- 


For sum resonance, we have sin jz = — sin py and det(C+B) < 0. The motion 
is unstable. When there are many skew quadrupole components in a ring, to 
the leading order and near the sum resonance, we have 


det(Ó + B) = — sin? u Xo HE. 
m=1 
where we use the abbreviation Km = (kl) Boys as eq. (11.3). The 
stop band is 
(cos Hæ — COS fly)” < sin? u 5 RS 


m=1 


and the full width, to the leading order, is 


For difference resonance, we have sin jz, = sin and det(C + B) > 0. The 
motion is stable and the minimum tune difference is 
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11.4 Third—Integer Resonance 


In this section we deal with the third-integer resonance, which is generated 
by the sextupoles introduced into the ring to compensate for chromaticity. 
Similar to the previous section, we will start with a particle with an arbitrary 
position and angle and demonstrate the resonant behavior when the tune 
is close to the resonance. Again, let us first consider the case that a thin 
sextupole is located at the end of the ring. The one turn map is 


CORN || 


E T 3 cos u + asin u sinp To 
~ at kr? —ysinu cosu— asinu) \ aoj” 


In the normalized space 


TAE 2) (2): 


i 
= [9] 
G+ ks biR 


The position and angle after n turns are 


[e = mon E) (=) PRO SL 


To the first order of ks, we obtain 


n—1 
rE 


m=0 
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To cos (nu) + ào sin (n 0 
= Deos (ny) AE "m v ; 
— io sin (nu) + Go cos (nj) [£o cos (nu) + Go sin (np)| 


PEN Y | {Zp cos [(n — m) u] + Go sin [(n — m) u] P? sin (mp) 


(ao cos [(n — m) u] + Go sin [(n — m) u] Y? cos (mp) 


_ [ Tocos (np) + Go sin (np) š 0 
2 = sin (nu) + do cos (nu) dua | [žo cos (np) + ŭo sin (nu)]" 
a ly 88 (32 o2 SURE 
5 RB ( p 0) x (2 ud 
+ Lk,83 (22 — ad) > T n 
4° joe cos [ 


(2n — m) u] — sin [(2n — 3m) uj 
m=1 ( ) Hl 
PM o bs [(2n — 3m) u] — cos [(2n — m) 2 
m=1 [( l 


2n — m) u] + cos [(2n — 3m) u] 

l 3 

+ =k, B2 
Petom sin [(2n — m) u] + sin [(2n — 3m) u] 


2 


Again, using the formula 


2 cos (x + my) = OUTOS UY G ME 200800 (11.7) 
we obtain 
cos ( (mp) =F cosl pc (m-1)u]-— V^ cos (y+ my) 
_ cos [u + (n; —2)u/2]sin|(n — 1) u/2] _ cos (nui/2) sin [(m — 1) u/2] 
sin (u/2) sin (u/2) 
Y; cos [(2n — m) u] = S$ cos((2n — 1) u — (m — 1) u] 
u . cos (3nu/2) sin [(n — 1) u/2] 
zx 2. cos [2n — 1) u — mp] = —— —  sn(m/2  ' 
Xen [((2n — 3m) u -Xes [((2n — 3) u — 3 (m — 1) u] 


u . cos (nj/2) sin [3 (n — 1) u/2] 
= 3 cos [(2n — 3) u — 3my] = = ungue ee 
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From eq. (11.7), we have 


she +my) = Xe (x + my - z) 
_ cos [z+ (n — D y/2 — 1/2|sin (ny/2) _ sin [xz + (n — 1) y/2] sin (ny/2) 
in (y/2) sin (y/2) 
As a result, we obtain 
V sin (ip) (iu) =F snl w+(m—1) py) = Y ein (y+ my) 
E dia 2) sin [(n — 1) u/2] 
sin (4/2) | 
X sin [( (2n — m -S sink (2n — 1) u — (m — 1) u] 
E sin [On — Duc mu) = S nw/2) sin (n — 1) u/2] 
x 2. (2n - 1) u- mp] = > — (u/3j 
ET (2n — 3m) u =F snl [(2n — 3) u — 3 (m — 1) u] 


B sinn — id _ sin(nu/2)sin [3 (n — 1) u/2] 
-»,sn[on-3)&-3mj- A REP s 


The position and angle after n turns are 


T, |  ( Tocos (nu) + ao sin (np) 3 0 
& E E sin (nu) + do cos m) ha? m cos (nu) + Go sin -— 


E sin [(n — 1) n/2] [ sin (nu/2) 

A Tkp? (% a) sin (4/2) (2 rod 

1 sin [(n — 1) u/2] [ sin (3nu/2) 
EE Jb? (8$ — a) sin (11/2) E E 

1 sin [3 (n — 1) u/2] ( — sin (nu/2) 
tq TT — ( cos (ny /2) 
2 sk 8a som [((n — 1) u/2] [ — cos (3nu/2) 

° sin (u/2) sin (3nj/2) 


qub 8*3 sin [3 (n — 1) u/2] ( cos (nu/2) 
2^ "0  sin(3u/2) sin (nu/2) J? 


284 An Introduction to Beam Physics 


which shows that the position and angle become arbitrarily large when u — 
2nk or 2z(k X 1/3). 

Next, we will study the deformation of the invariant. Since the sextupole 
affect only the nonlinear part of the motion, the perturbation of the invariant 
is of the third order and higher. One way to obtain the new invariant is to find 
a new coordinate system in which the motion is a circle. The new invariant 
can be found via the relation between the new and the old coordinates. This 
method is called the normal form theory. In general, there is no analytical 
solution to the perturbed invariant. We can only obtain it perturbatively. In 
order to demonstrate the essence of the method, the lowest order perturbation 
of the invariant will be derived. Let 3 and à be the new coordinates such that, 
in this coordinate system, the motion is a circle up to the second order. Sines 
the linear motion in the coordinate system of (Z,@) is already a circle, the 
general form of the relation between the new and the old coordinate systems 
can be written as 


z ü ü Ai? + Aisdü + Azo? 

a) SAol p=]. 4+ m e pos 

a a a DB11z^ + Biota + Booa 
where A denote a second order transfer map that transforms (x,a). The 
inverse, to the second order, is 


From the relation 


OJ- heia) PaE] 


we obtain the one turn map in the new coordinates, which is 


n — Ao T , " MU "EM To . (11.8) 
a a + ksb? x? —rsiny + a cos u dp 


Expanding it to the second order, we have 


Tı EA cos u + Go sin u Ziz Zon 23 
~ = z. ~ + + + ; 
ay — To SIN u + Ao COS u 71a 22a 23a 
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where 
~2 
Ziz = Ai (5; cos? u + Todo sin (2u) + ŭo sin 
iz 
+ Ajo (-58 sin (24) + Todo cos (2u) + 5n sin 2n) 
~2 ~2 
+ Age (o sin? u — Todo sin (21) + ŭo cos *u), 
~2 ~2 
Zia = Buy (Zo cos? u + Todo sin (24) + ŭo sin i 
1c 
+ Bi» (iesin sin (2u) + Tao cos (2u) + =i sin (2n) 
~2 
+ Boo (3; sin? u — Todo sin (24) + ŭo cos? i) ; 
z2 BE z2 z2 aoe z2\ S 
Z9g =— (Anz + A123:9ü9 +r Ansty) COS [1 — (Bu + By2%9ao + Bačo) SIN LL, 


x2 Z z2N . x2 = = z2 
Zig = (Anz ae A1239d9 Ex Asa) Sin u— (But + By2%oao + Basti) COS LL, 


232 = 0, 
~2 
Z3q = kB? (5; cos? u + Todo sin (24) + à, sin 25 


Since the one turn map in the new coordinates is a rotation up to the second 
order, we obtain the following equations 


A11 cos? u — — sin (211) + A22 sin? u — A14 cos u — By, sin u = 0, 

A11 sin (2u) + A12 cos (2u) — As» sin (2u) — A1» cos i — B1» sin ui = 0, 

A11 sin? u + lAn sin (2u) + Ago cos? u — Avs cos u — Bag sin u = 0, 

Bıı cos? — IB sin(2u4) + Bog sin? u + Aj; sin u — By cos u = —k,B? cos” LL, 
By, sin(2u)+ B1» cos(2) — B22 sin(211) + A15 sin ji — B12 cos u = —k,B3sin(2y), 
By, sin? + 5 Bus sin(2u) + B22 cos? u + Ao» sin u — Bog cos u = —k,83 sin? pu. 


After tedious but straightforward algebraic and trigonometric manipulations, 
the solution is obtained, which is 


, eos? (1/2) 
sin (3u/2)' 


Bog = 0. 


3 cos (u/2) cos u 
Aii = —k, 62 ——————— Aj2=0, A22 = —k, 
s? 2sin(3u/2) ' 5 : i d 


153 3 COS (u/2) cos u 
By =—=k.B?, By = kp? — TEM 
11 7 p? 12 B? sin (31/2) 


As a result, the transformation defines a coordinate system in which the mo- 


tion is a rotation up to the second order. In other words, we have 
~2 x2 


x +4 =e, 
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which take the form 
(Z + An? + Aood?)” + (a+ Buk + Bia). =e 


in the old coordinates. Keeping only the terms up to the first order of k,, we 
have 
g? +07 + 2A? + 2B? + 2 (A22 + By) qa? =€. 
Since 
cos? (u/2) | cos(u/2) cos u 
AGS p Bock Ba A eee 
22 + Bu = kef? |— (3u/2) sin (31/2) 
cos (u/2) [cos u — cos? (u/2)] 


= kape sin (34/2) 
a cos (u/2) sin? (u/2) 3 sin (4/2) sin u 
Se sin (3/2) TU 2sin(3u/2) ' 


the perturbed invariant is 


a cg gi e a 4 a4 ee EE T 
or 
PHT- kb? C. [4 cos (u/2) + asin (u/2)] [T cos u + à sin u] = e. 
sin (34/2) 


Apparently the invariant diverge when u > 22/3. The invariant in the above 
equation is shown in Fig. 11.3, which shows the presence of the third-integer 
resonance. As a comparison, the true invariant obtained through tracking 
is shown in Fig. 11.4. It is clear that the invariant obtained from the first 
order perturbation theory agrees with the exact invariant qualitatively but 
not quantitatively. 

Like the half-integer resonance, we can go one step further to include the 
terms that are proportional to k?. Again, we can attempt to find another 
coordinate system in which the motion is a rotation up to k?. But first of all, 
we have to obtain the third order one turn map in the coordinates of G, à). 
From eq. (11.8), we have, to the third order, 


E £ x cos u + asin E 
kis E Ao : A H H oA- 1o i 
a1 a + k, 822? —x sin p + a cos u do 


z + A112? + Aga? x COS u + asin u 
= O 

a + Buz? + Bioxa —z sin u + a cos u + ks 8? (x cos u + asin p)? 
= z2 z2 z3 mare ie 
zo — A118, = A»2üg " 2 A? do + 2425 Bi Fao + Ao2 Bi2% dg 
Z za n erac eu ee = 
ao — Big — By2%0ao 2 A? Zao + 24253 B41T9ag + Ao2 Bi 2d 
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FIGURE 11.3: Invariant obtained through first order perturbation theory 
(left) and tracking (right), with k,9? = 1 and v = 130/360. 
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FIGURE 11.4: Invariant obtained through first order perturbation theory 
(left) and tracking (right), with k,9? = 1 and v = 110/360. 


The last line is the function A~! to the third order. It is straightforward to 
verify that it is true using the relation Byz = —2441. Yet Aand AT! are not 
symplectic to the third order, which is easily verified using eq. (5.6). The 
details of obtaining the symplectic version of Aand AT! are beyond the scope 
of this book. The result is actually very simple, which is 


E ~2~ zc 
2 2 20% pono" UC. 
E = zx Ac + Asa m i Ag Wr, A23 By Zao = A11 A2200 
c2 zc z J?’ 
a+ Bia? = 2A11xa + A? odo + A22By1 2a P A11 A224 
2 2 9 Z3 nes EEA 
A`! = X — Aur = A24 E Af xU A23 Bi Zao = A11 A2200 


mex EL ZA x3 
a— Bia? + 2A11xa + Ai Zao + A22By1 2a = Aj A224 


Again, it is straightforward to verify the symplecticity of the above map using 
eq. (5.6). Note that the third order part in Aand A“? is half of that in the 
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non-symplectic version of A7!. 
After a straightforward yet rather lengthy derivation, the final result is 


= = ONE x3 ~2~ DE m 
2 n oiu RR een n Aio + A112290 + A1z2%0d9 + A2220 
eV esos t ~3 on oe “Ag 
s —voSIn H + ao COSH) NBijizo + Biizt9ào + B1221909 + B2220 
where 
diy = gs OE soos 
111 s sin (3/2) i 
3 (n/2) cos u 
Api e uu e) oa 
112 32 2sin (31/2) [3 cos (24) js 
^ (u/2) sin (u/2) 
i= pp Oe, : 
122 = kB sin (31/2) [1 + 3cos (24)] , 
dis adigi eon uu econ (s 
: sin (34/2) , 
and 


By = 26° (m (u/2) sin? (u/2)cosu cos (u/2) cost e) 


sin (34/2) sin (34/2) 

(cos! (4/2) sin (1/2) 
sin (34/2) 

sin (u/2) cos? (u/2) cos? u \ 

sin (34/2) i 


[3 cos (24) — 1] 


sin? (11/2) cos? (u/2) cos? 3 
sin (34/2) : 
cos? (u/2)sinp cos? u cos (u/2) cos usin? ji 
sin (31/2) sin (31/2) 


The next step is to find a second order transformation (in terms of ks) that 
the third order map in the newest coordinates is a rotation. Unlike the first 
order transformation, not all nonlinear terms in the map can be removed even 
if u does not satisfy any resonance condition (i.e., v is irrational.). It is much 
easier to illustrate this in the eigenspace of the linear map, which is complex. 
Let us denote (st,s~) as the complex coordinates which are related to the 
real coordinates by the relations 


Bj; = k20? { [1 + 3cos (21)] 


+6 


cos? (1/2) cos u 


Biz; = k28? 
T il 2sin (34/2) 


+12 


Bo22 = E \. (11.10) 


(11.11) 


AT, 
v a 
4 | 
Nai a 
II 
Sl- 
N 
4 X 
=. e 
| ~ 
S: 3 
Ne 
aN 
Su RI 
Ne 
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E 1 1 1\ ls- 
GaC) e 


It is obvious, in this case, that the map in the complex coordinates is sym- 
plectic as long as that in the real coordinates is, since the determinant of the 
Jacobian in the complex coordinates equals that in the real coordinates, which 
equals to 1. The linear one turn map in the complex coordinates is 


DEJE JOE 233) v 


For the time being, let us consider the generic case that the map M contains a 
linear part R and a nonlinear part S. In the complex coordinates, R is shown 
above. Furthermore, let us assume that the lowest order terms in S are of 
that of m. Note that if m > 2, M is the map that has been transformed 
through the nonlinear normal form transformations at least once. Now let us 
consider M only up to the mth order, i.e., 


The inverse is 


Mm =R+ Sm- 
Define the coordinate transformation as 
Am — fa + Tm: 


where Z is the unity map and Tm contains terms of the mth order only. Hence, 
to the mth order, 
ds -1-— Tan: 


The normalized map, to the mth order, is 


Nm =m Am 0 Mm 0 A! =m (ET) 9 (R- S5) 9 (L— Ta) 
=m (Z +Tm) o (R+Sm— R o Tm) =m Rc Ss — (Tmo R =R o Tm). 


The goal is to cancel as many terms in Sm as possible. Let us evaluate the 
map Tm o R — R o Tm, which is 


Tm o R EE R o Tm 
m = mky QAm—k i 1 k pAm—k 
3 Tk (s ) (s ) S e "89 e e "s 6 Tnk (so) (so ) 
T* (s-)*( xut ei^ gt ei gt 4 -\k_4\m—k 
k=0 mk $ 0 Tmk (sc) (sd ) 
m E _\k m-—k ni RE 
E B (ss) (33) [ete 9 - e 
= _\k m-—k r iu(m— i : 
ico \Ting (80) (50) — [e^ 9 - e" 
Apparently terms in S,, cannot be removed if the corresponding terms in the 
map Tm o R — R o Tm are zero, and they are zero if 


II 


mk 


gii (m—2k) — etin — 0). 
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That is 


gii m-2kr1) Ez 


, 


which is fulfilled when 
ii (m — 2k x 1) = 2n, 


where n is an integer. Apparently, the solutions of this equation can be divided 
into two classes: one that is independent of u, which is k = (m x 1)/2, and 
the other that depends on u. The solutions that depend on p are called the 
resonance conditions, which can be avoided with the choice of u. The ones that 
are independent of u provide the terms that cannot be removed from the map 
regardless of the choice of the tune. First, let us consider the case of m — 2. 
Since both m and 2k are even, under no circumstance m — 2k + 1 = 0. This 
is consistent with the fact that a solution was found above that transforms 
the second order map into a linear map. For the u dependent solutions, it 
is straightforward to verify that all six solutions are included in one simple 
relation, which is 3u = 27n. Note that this is none other than the condition 
of the third-integer resonance. Second, we consider the case of m — 3. There 
are two solutions that are independent of u, which are k = 2 for the top row 
and k — 1 for the bottom row. Again, all eight solutions that are dependent 
on p are included in the expression 4u = 27, which is the condition of the 
fourth-integer resonance. In case of the sextupole, it drives the third-integer 
resonance to the first order of k, and the fourth-integer resonance to the 
second order of k,. Since we usually set the tune away from the third and 
the fourth-integer resonances, we can obtain a third order map that takes the 


form of " 
M= | e "sg + S35 (s ) (sj) 


ein st + St (55) (si) 


Let us focus on the new map, since we will not try to obtain the second order 
distortion of the invariant. Going one step further, we have 


N. [e H+ $3589 SQ | So 
s5 | fem 4 shes at] af) 


Transforming back to the real coordinates, we obtain 


ui 


"m -( To cos u + do ML Go + ay | (S5 + S31) o + i(S55 — | 
3= oes a —— —— € m. 
45 \=i(S52 — 331) To + (S32 + $31) do 
(11.14) 
As a result, we conclude that 83; -- S3; is real and $55 — 93, is purely imaginary. 
In order to make clear the meaning of the third order terms in M3, let us first 
show that M3 is symplectic to the third order. First recall that 


—& sin u + ŭo cos u 


N3 =3 Az o Az o Ma o A5! o Aj’. 
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Defining A23 =3 A3 o A2, we have 
N3 —3 A23 0 M2 o Azz 


When carrying out the transformations, we usually make sure that .4»3 is 
symplectic up to the third order. Therefore, we have 


Ax eA » Wc A s TS T ^ A s "A d eet ds 
Nz INT =. Áo MÀ (Ass Mog] ) => Ao Mod Î (âz) MT AT, 
=9 Âo MoJ MI A =2 Ao3 J Ad, =2 Î. 


Now that we have shown that M3 is symplectic up to the third order, we 
can find out the relations between the terms. The Jacobian of M3 can be 
written as 


Ñ; = Ê + 8s, 


—d T" Ex —42 
Ro ( 9) ua = (PTS, m) 
EE Sá (s0) S5soso 


The symplectic condition becomes 


where 


(£-- ôs) j j (RT + 8T) = J, 
which leads to the relation 
SJ RT + RJST =0. 
Plugging in the matrices R and $5, we have 
Sos 35) 0 1\ feci" 0 
Sh (si) Sásosb / \-1 OF \ 0 e" 
n [| 0 | 0 ) ex Sj ig E 
i 2 E 
sd ae Sza (so) $3180 80 
which can be simplified to 
m 0 1 
(S356 '^ + Sd; e") sy sd E i) = 0. 
Defining 
Ts = iSj5e'", 


we obtain 
S35 —iTse "H, ene iTg ei^ 
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Furthermore, we obtain that 
S39 + S]; — —2Tssinu, S3q — Sj, = —2iTs cos u. 
Hence we arrive at the conclusion that Ts is real. Therefore, we have 


Mie s [1 — iTssg sd | 1) 


etn [1 + iT 585 sa] st 


To the first order of Ts, N3 can be expressed as 


LN E [-i (u + Tss5 s0 )] so ) | 


exp [ a (u + Ts$9 89 | st 
It is clear by far that the remaining terms in the normalized map N3 contribute 
to the change of the tune only. It is worth noting that the change of the tune 
is a function of the invariant, which is sometimes called the tune shift with 
amplitude. Computationally, the above described procedure can be easily 


carried out using the Differential Algebraic (DA) technique, which is valid for 
any given order. 


Now the tune shift with amplitude can be determined through the relations 
between the coefficients in the real and the complex coordinates. Repeating 
eq. (11.13) to the third order, using eqs. (11.11) and (11.12), we have 


s \ dp i cosu sinp 11 So 
st) BAG — sinu cosp —i i 8l 
st) + A112 (so + st)’ (- RU 
j) + Bus (89 ts (—isg + is) 
H) (—isg + ist)” + A22 (—isy + isf)” 
.) 


(—isy + ish)” + B222 (—isy isi) 


3 


ll 
re aS 
o 
> ıl 
E 
Bo 
= 
Sas 
4 7X 
n 5 


S 


m: (Aui +iB111 
4\( 


"m teeth og 
Tt (Aui2 t iBi12 +89) (is 
Ani-iBii 


) X 
YE (Au2—iBuiz)( +f) (isp + isg ) 
4 “ys +iB122 ) (iso 4 isj)” ETE d 

4 KCAiz2 —iBi22)(5 ) (-isy +is{)”+(4222—iB222)(—is5 isd) 


It is on to extract the coefficients S35 and SÀ, which are 


0 
So 


WKS YH o— 
—_~—~ — 
Cn 
e| 
w 


se) 


[3 (A111 +iB111)— i (A112 +iB112)+ (A122 +7. Bi22) — 31 (A222 +iB222)], 


S == [3 (A111 —iBiii) +7 (A112 —iB112) + (A122 ^ (B122) 4-31 (A222 —iB222)). 


4 
1 
4 
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Furthermore, we have 
l 3 4 
4 (S32 + S31) 


i = 4 
4 (S35 = S41) 


From eqs. (11.9) and (11.10), we obtain 


due +\_ 38 42 Cos (u/2) sin u cos u 
1 (S32 + S31) = g ksb — dE. ^ 
E ay 3 cos (4/2) cos? u 
4 654) = 7 kB sin(3u/2) ` 


( 3411 + B112 + A122 + 3B222) , 


(—3B111 + A112 — B122 434222). 


As a result, eq. (11.14) becomes 


M= | Var inr) 


—Zo sin u + do cos u 
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where Aj = — (3/8) - K285 - (cos (1/2) cos / sin (34/2) : (8, + 35). Note that 
x2 
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Au is proportional to A + ŭo, which is an invariant of motion. Note that 
the distortion of the invariant of motion is proportional to k. Hence the tune 
shift is small compared to the distortion of the invariant. The result is that 
the third-integer resonance usually leads to arbitrary large distortion of the 
invariant. For higher order resonances, the distortion is either of the same 
order or smaller than the tune shift. The resonances are, therefore, confined 


in the phase space. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


References 


L. W. Alvarez. Linear accelerator. US Patent 2,545,595, 1951. Filed 
1947. 


J. Arthur, P. Anfinrud, P. Audebert, et al. Linac Coherent Light Source 
(LCLS) conceptual design report. Technical Report SLAC-R-593, SLAC 
National Accelerator Laboratory, 2002. 


U. Bechstedt, J. Dietrich, R. Maier, et al. The cooler synchrotron COSY 
in Jülich. Nuclear Instruments and Methods, B113(1-4):26—29, 1996. 


M. Berz. Differential algebraic description of beam dynamics to very high 
orders. Technical Report SSC-152, also ON: DE90013777 and TRN: 90- 
023092, Lawrence Berkeley National Laboratory, SSC Central Design 
Group, 1988. OSTI ID: 6876262. Also Particle Accelerators, 24:109, 
1989. 


M. Berz. Modern Map Methods in Particle Beam Physics. Academic 
Press, San Diego, 1999. 


M. Berz, H. C. Hofmann, and H. Wollnik. COSY 5.0, the fifth order 
code for corpuscular optical systems. Nuclear Instruments and Methods, 
A258:402-406, 1987. 


M. Berz and K. Makino. COSY INFINITY. 
http://cosyinfinity.org, (accessed August 2014). 


I. G. Brown, editor. The Physics and Technology of Ion Sources. Wiley- 
VCH, Weinheim, second edition, 2004. 


K. L. Brown, F. Rothacker, D. C. Carey, and C. Iselin. TRANSPORT 
a computer program for designing charged particle beam transport sys- 
tems. Technical Report SLAC-91 Rev. 3, also CERN-80-04 and NAL-91, 
Stanford Linear Accelerator Center, Fermi National Accelerator Labo- 
ratory, CERN, 1983. 


P. J. Bryant and K. Johnsen. The Principles of Circular Accelerators 
and Storage Rings. Cambridge University Press, Cambridge, 1993. 


D. C. Carey. The Optics of Charged Particle Beams. Harwood Academic, 
New York, 1987. 


A. W. Chao. Physics of Collective Beam Instabilities in High Energy 
Accelerators. Wiley, New York, 1993. 


295 


296 


[13] 


[14] 


[15] 
[16] 


[17] 


[18] 
[19] 


[20] 


[21] 


[22] 


[23] 


24 


25 


26 


References 


A. W. Chao and M. Tigner, editors. Handbook of Accelerator Physics 
and Engineering. World Scientific, New Jersey, second edition, 1999. 


J. D. Cockcroft and E. T. S. Walton. Experiments with high velocity 
positive ions. — (I) Further developments in the method of obtaining 
high velocity positive ions. Proceedings of the Royal Society of London, 
A, 136:619-630, 1932. 


M. Conte and W. W. MacKay. An Introduction to the Physics of Particle 
Accelerators. World Scientific, New Jersey, second edition, 2008. 


E. D. Courant and H. S. Snyder. Theory of the alternating-gradient 
synchrotron. Annals of Physics, 3:1, 1958. 


K. R. Crandall, R. H. Stokes, and T. P. Wangler. RF quadrupole 
beam dynamics design studies. In R. L. Witkover, editor, Proceedings 
of the 1979 Linac Accelerator Conference, pages 205-216. Brookhaven 
National Laboratory, 1979. BNL-51134. 


R. J. Van de Graaff. Electrostatic generator. US Patent 1,991,236, 1935. 
Filed 1931. 


R. J. Van de Graaff. Tandem electrostatic accelerators. Nuclear Instru- 
ments and Methods, 8:195—202, 1960. 


R. J. Van de Graaff, K. T. Compton, and L. C. Van Atta. The elec- 
trostatic production of high voltage for nuclear investigations. Physical 
Review, 43(3):149-157, 1933. 


L. Deniau, F. Schmidt, C. Iselin, et al. MAD — Methodical Accelerator 
Design. 
http://mad.web.cern.ch/mad/, (accessed August 2014). 


A. J. Dragt. MaryLie code and MaryLie manual information and down- 
load page. 

http://www.physics.umd.edu/dsat /dsatmarylie.html, (accessed August 
2014). 


A. J. Dragt, L. M. Healy, F. Neri, and R. Ryne. MARYLIE 3.0 -a 
program for nonlinear analysis of accelerators and beamlines. IEEE 
Transactions on Nuclear Science, NS-3,5:2311, 1985. 


D. A. Edwards and M. J. Syphers. An Introduction to the Physics of 
High Energy Accelerators. Wiley, New York, 1993. 


D. A. Edwards and L. C. Teng. Parametrization of linear coupled motion 
in periodic systems. IEEE Transactions Nuclear Science, 20(3):885-888, 
1973. 


D. Einfeld, J. Schaper, and M. Plesko. Design of a diffraction limited 
light source (DIFL). In Proceedings of the 1995 Particle Accelerator 
Conference, volume 1, pages 177-179. IEEE, 1996. 


27 


28 


29 


30 


31 


[32] 


[33] 


[34] 


35 


36 


37 


38 


39 


[40] 


References 297 


E. Forest. Beam Dynamics: A New Attitude and Framework. Harwood 
Academic Publishers, Amsterdam, 1998. 


R. Geller. New high intensity ion source with very low extraction voltage. 
Applied Physics Letters, 16(10):401—404, 1970. 


H. Goldstein. Classical Mechanics. Addison-Wesley, Reading, MA, sec- 
ond edition, 1980. 


P. W. Hawkes and E. Kasper. Principles of Electron Optics, volume 1-3. 
Academic Press, London, 1996. 


J. Ishikawa. Negative ion sources. In I. G. Brown, editor, The Physics 
and Technology of Ion Sources, pages 285-310. Wiley-VCH, Weinheim, 
second edition, 2005. 

http:/ /onlinelibrary.wiley.com/doi/10.1002/3527603956.ch14/summary. 


G. H. Jansen. Coulomb Interactions in Particle Beams. Advances in 
Electronics and Electron Physics: Supplement 21. Academic Press, New 
York, 1990. 


D. Johnson. Private communication for the lattice data of the FODO cell 
of the Fermilab Main Injector at Fermi National Accelerator Laboratory. 


L. W. Jones and K. M. Terwilliger. A small model fixed field alternating 
gradient radial sector accelerator. In E. Regenstreif, editor, Proceedings 
of CERN Symposium on High Energy Accelerators and Pion Physics, 
pages 359-365. CERN, European Organization for Nuclear Research, 
1956. CERN 56-25, Volume 1. 


S. P. Kapitza. The Microtron. Accelerators & Storage Rings, 1. Harwood 
Academic, London, 1978. Originally published by I. Nauka, Moscow, 
1969. 


M. Kauderer. Symplectic Matrices: First Order Systems and Special 
Relativity. World Scientific Publishing Co., Singapore, 1994. 


D. W. Kerst. The acceleration of electrons by magnetic induction. Phys- 
ical Review, 60:47—53, 1941. 
https://journals.aps.org/pr/abstract/10.1103/PhysRev.60.47. 


D. W. Kerst. Magnetic induction accelerator. US Patent 2,335,014, 
1943. Filed 1942. 


O. L. Krivanek, N. Dellby, and M. F. Murfitt. Aberration correction in 
electron microscopy. In J. Orloff, editor, Handbook of Charged Particle 
Optics, pages 601—640. CRC Press, Taylor & Francis Group, London, 
second edition, 2009. 


E. O. Lawrence. Method and apparatus for the acceleration of ions. US 
Patent 1,948,384, 1934. Filed 1932. 


298 


[41] 


[42] 


[43] 


[44] 


[45] 


[46] 


47 


48 


49 


50 


5l 


52 


53 


[54] 


References 


J. D. Lawson. The Physics of Charged-Particle Beams. Clarendon Press, 
Oxford, 1988. 


LBNL. 1-2GeV synchrotron radiation source, conceptual design report. 
Technical Report LBNL PUB-5172 Rev., Lawrence Berkeley National 
Laboratory, 1986. 


S. Y. Lee. Accelerator Physics. World Scientific, New Jersey, second 
edition, 2004. 


M. P. Level, P. C. Marin, P. Nghiem, E. M. Sommer, and H. Zyngier. 
Progress report on Super-ACO. In E. R. Lindstrom and L. S. Taylor, 
editors, Proceedings of the 1987 IEEE Particle Accelerator Conference, 
volume 1, pages 470-472. National Bureau of Standards, Los Alamos 
National Laboratory, 1987. OSTI ID: 5125784, CONF-870302- Vol.1. 


M. S. Livingston and J. P. Blewett. Particle Accelerators. McGraw-Hill, 
New York, 1962. 


E. J. Lofgren. Bevatron operational experiences. In E. Regenstreif, 
editor, Proceedings of CERN Symposium on High Energy Accelerators 
and Pion Physics, pages 496-503. CERN, European Organization for 
Nuclear Research, 1956. CERN 56-25, Volume 1. 


J. M. J. Madey. Stimulated emission of bremsstrahlung in a periodic 
magnetic field. Journal of Applied Physics, 42(5):1906-1913, 1971. 


K. Makino. Rigorous Analysis of Nonlinear Motion in Particle Acceler- 
ators. PhD thesis, Michigan State University, 1998. 


K. Makino and M. Berz. Perturbative equations of motion and dif- 
ferential operators in nonplanar curvilinear coordinates. International 
Journal of Applied Mathematics, 3,4:421—440, 2000. 


K. Makino and M. Berz. COSY INFINITY version 9. Nuclear Instru- 
ments and Methods, A558:346—350, 2006. 


T. Matsuo, H. Matsuda, Y. Fujita, and H. Wollnik. Computer program 
'TRIO for third order calculation of ion trajectory. Mass Spectroscopy, 
24:19-61, 1976. 


L. Michelotti. Intermediate Classical Dynamics with Applications to 
Beam Physics. Wiley, New York, 1995. 


M. Nishiguchi and M. Toyoda. Computer program TRIO 2.0 for calcu- 
lation and visualization of ion trajectories. Physics Procedia, 1:325-332, 
2008. 


H. Nishimura. Private communication for the lattice data of the Booster 
to Storage Ring beam transfer line at the Advanced Light Source at 
Lawrence Berkeley National Laboratory. 


55 


56 


57 


58 


59 


60 


61 


62 


63 


64 


65 


66 


67 
68 


[69 


[70] 


References 299 


J. Orloff, editor. Handbook of Charged Particle Optics. CRC Press, 
Taylor & Francis Group, London, second edition, 2009. 


J. Picht. Beitrage zur Theorie der geometrischen Elektronenoptik. 
Annalen der Physik, 407(8):926—964, 1932. 


J. R. Pierce. Rectilinear electron flow in beams. Journal of Applied 
Physics, 11:548—554, 1940. 


M. Reiser. Theory and Design of Charged Particle Beams. Wiley-VCH, 
Weinheim, second edition, 2008. 


H. Rose. Historical aspects of aberration correction. Journal of Electron 
Microscopy, 58(3):77-85, 2009. 


H. Rose. Geometrical Charged-Particle Optics. Springer-Verlag, Berlin, 
second edition, 2012. 


Y. Sasaki and S. Maruse. Über die Arbeitsweise und die elektronenop- 
tischen Eigenschaften der Spitzenkathode. In G. Móllenstedt, H. Niehrs, 
and E. Ruska, editors, Physikalisch- Technischer Teil, Band 1, pages 9- 
13. Springer-Verlag, 1960. Fourth International Conference on Electron 
Microscopy, Berlin 1958. 


O. Scherzer. Über einige Fehler von Elektronenlinsen. Zeitschrift für 
Physik, 101(9-10):593-603, 1936. 


A. Septier, editor. Applied Charged Particle Optics. Academic Press, 
New York, 1980, 1983. Part A, B, C. 


D. H. Sloan and E. O. Lawrence. Production of heavy high speed ions 
without the use of high voltages. Physical Review, 38:2021—2032, 1931. 
https://journals.aps.org/pr/abstract/10.1103/PhysRev.38.2021. 


C. Steier. Private communication for the lattice data of the Advanced 
Light Source at Lawrence Berkeley National Laboratory. 


V. Suller. Private communication for the lattice data of the storage at 
Center for Advanced Microstructures and Devices at Louisiana State 
University. 


M. Szilagyi. Electron and Ion Optics. Plenum Press, New York, 1988. 


R. M. Tromp, J. B. Hannon, A. W. Ellis, W. Wan, A. Berghaus, and 
O. Schaff. A new aberration-corrected, energy-filtered LEEM/PEEM in- 
strument. I. Principles and design. Ultramicroscopy, 110:852-861, 2010. 


V. Veksler. A new method of acceleration of relativistic particles. Journal 
of Physics (Moscow), 9(3):153, 1945. 


T. P. Wangler. RF Linear Accelerators. Wiley-VCH, Weinheim, second 
edition, 2008. 


300 


71 


72 


73 


74 


75 


76 


TT 
78 


79 


[80] 


[81] 


References 


H. Weick. GICOSY - based on COSY 5.0 with additions done later in 
Giessen. 
http://web-docs.gsi.de/ weick/gicosy/, (accessed August 2014). 


H. Weick. GIOS - General Ion Optical System. 
http://web-docs.gsi.de/ weick/gios/, (accessed August 2014). 


R. Wideróe. Über ein neues Prinzip zur Herstellung hoher Spannungen. 
Archiv für Elektrotechnik, 21:387, 1928. 


H. Wiedemann. Particle Accelerator Physics. Springer, Berlin, third 
edition, 2007. 


K. Wille. The Physics of Particle Accelerators: An Introduction. Oxford 
University Press, New York, 2000. In English, orignal in German, 1996. 


E. J. N. Wilson. An Introduction to Particle Accelerators. Oxford Uni- 
versity Press, New York, 2001. 


B. Wolf, editor. Handbook of Ion Sources. CRC Press, New York, 1995. 
H. Wollnik. Optics of Charged Particles. Academic Press, Orlando, 1987. 


H. Wollnik and M. Berz. Relations between the elements of transfer 
matrices due to the condition of symplecticity. Nuclear Instruments and 
Methods, A238:127-140, 1985. 


H. Wollnik, B. Hartmann, and M. Berz. Principles of GIOS and COSY. 
AIP Conference Proceedings, 177:74, 1988. 


M. Yavor. Optics of Charged Particle Analyzers, volume 157 of Advances 
in Imaging and Electron Physics. Academic Press, San Diego, 2009. 


Index 


Aberration, 36, 115, 116, 167 
Chromatic, 163, 178, 235 
Derivation, 107 
Electron Microscope, 177, 178, 

183 
Integral Form, 110 


Order-by-Order Computation, 109 


Spectrograph, 170 
Spherical, 169, 177 
Acceleration, 4, 11 
Accelerator 
Mass Spectrometer, 164 
Physics, 4 
Acceptance, 164 
Achromat 
Double-Bend (DBA), 226, 229- 
231, 234 
Multiple-Bend (MBA), 230, 233 


Triple-Bend (TBA), 226, 230-232, 


234 
Achromaticity, 236 
Adiabatic Damping, 212 
Advanced Light Source (ALS), 29, 
30, 149, 232, 256 
Alpha, see Twiss Parameter 
ALS, see Advanced Light Source 
Alternate Gradient, 207 
Focusing, 25 
Synchrotron, 28 
Alvarez, L. W., 18 
Analytic 
Complex Variables, 120 
Angular Momentum, 105 
Anode, 5 
Antiproton Source, 4 
Arc Length, 32, 60, 61 
Astrodynamics, 1 


B Rho, 20 
Ballistic, 7 
Bunching, 240 
Barber's Rule, 166 
Barrel Distortion, 163 
Beam, 10 
Current, 15 
Ellipse, 194 
Optics, 31 
Periodic Transport, 189 
Production, 4 
Beating, 159, 198 
Bending Magnet, see Dipole 
Bessel Function, 242 
Beta, see Twiss Parameter 
Betatron, 19, 21, 207 
Condition, 22 
Bevatron, 26 
Binoculars, 163 
BMDO, see Strategic Defense 
Initiative 
BNL, see Brookhaven National 
Laboratory 
Bp, see B Rho 
Brightness, 7 
Brookhaven National Laboratory 
(BNL), 28 
Browne-Buechner Spectrograph, 166 
Bunch Compressor, 236, 240 
Buncher, 240 


Camera, 41, 162 

Capture, 9 

Carbon Foil, 8 

Cathode, 4, 5 
Ray Tube, 161 

Cavity, 22, 29, 212, 232, 240, 241 
Electromagnetic Field, 241 


302 


Field Distribution, 241 
Longitudinal Dynamics, 252 
TMo10, 242 
TMi10, 243 
Transverse 
Dynamics, 257 
Focusing, 259, 260 
Co, see Aberration, Chromatic 
CEBAF, see Continuous Electron 
Beam Accelerating Facility 
Centroid, 263 
CERN, see European Organization 
for Nuclear Research 
Cesium, 7, 9 
Cesiation, 7 
Charge, 1 
Chicane, 236, 240 
Child-Langmuir Law, 5 
Chromatic Aberration, see 
Aberration, Chromatic 
Chromaticity, 200, 213, 268, 281 
Correction, 204 
Natural, 202 
Closed Orbit, 232, 263 
Cockcroft, J. D., 12, 13 
Cockcroft- Walton, 12, 13 
Cold Field Emission Gun (CFEG), 7 
Collider, 19, 26, 28-30 
Coma, 163 
Combination Systems, 48 
Complex Coordinates 
Rotational Symmetry, 119 
Continuous 
Electron Beam Accelerating 
Facility (CEBAF), 23 
Rotational Symmetry, 120 
Wave, 15, 22 
Coordinates 
Curvilinear, 58, 60, 62 
Cylindrical, 50 
Particle Optical, 34, 62 
Cosine-like Ray, 178, 179 
COSY Ring, 27, 28, 30 
Coupling, 157 
Resonance, 271 


Index, 


CRT, see Cathode Ray Tube 

Cg, see Aberration, Spherical 

Curvilinear Coordinates, see 
Coordinates, Curvilinear 

CW, see Continuous, Wave 

Cyclotron, 23, 24 

Cylindrical Deflector, see 
Deflector, Cylindrical 


DA, see Differential Algebra 
Damping, 212 
Rate, 232 
DBA, see Achromat, Double-Bend 
Deflector 
Cylindrical, 86 
Electrostatic, 83 
Spherical, 87 
Transfer Matrix, 86 
Defocusing Lens, 40 
Delta Function, 78, 91, 99 
Determinism, 35 
DFELL, see Duke Free Electron Laser 
Laboratory 
Difference Resonance, see 
Resonance, Difference 
Differential Algebra (DA), 111, 115, 
132, 159, 199, 200, 205, 249, 
292 
114, 129 
n. Dy, 131 
Arithmetic, 129, 131 
Concatenation, 137 
COSY INFINITY, 134 
Derivatives, 130 
Functions, 133 
Map 
Composition, 137 
Computation, 134 
Inversion, 138 
Manipulation, 137 
Numerical Integration, 136 
Reversion, 139 
Variable 
Multiple, 131 
Single, 129 


Index 303 


Dipole, 73, 143, 210, 236 Moment, 1 
Edge, 76 Quadrupole, 70 
Error, 261 Rigidity, 64 
Rectangular, 79 Electron 
Sector, 76 Capture, 9 
Transfer Matrix, 76, 81, 83 Cyclotron Resonance (ECR) 
Dirac Delta Function, see Heating, 10 
Delta Function Ion Source, 9-11 
Discrete Rotational Symmetry, 120 Microscope, 57, 98, 162 
Dispersion, 165, 168, 198, 224, 227 Low Energy (LEEM), 8, 176, 
Periodic Solution, 198 177, 183, 185, 187 
Suppressor, 224 Photo Emission (PEEM), 176, 
DLD, see Drift-Lens-Drift System 177, 183, 185-187 
Double Midplane Symmetry, see Scanning (SEM), 176-179, 183 
Symmetry, Double Midplane Scanning Transmission (STEM), 
Double-Bend Achromat, see 176, 177, 179, 181 
Achromat, Double-Bend TEAM Corrector, 183 
Doublet, 178, 229, 230 TEAM Project, 176, 182 
Dresden High Magnetic Field Transmission (TEM), 7, 176, 
Laboratory (HLD), 26 177, 180-183 
Drift, 37, 69, 142 Transmission, Aberration - cor- 
Drift-Lens-Drift (DLD) System, 45- rected (TEAM), 181, 182 
47 Source, 4 
Driving Term, 110 Volt, 4 
Duke Free Electron Laser Laboratory Electrostatic 
(DFELL), 215 Deflector, 83 
Dynamic Aperture, 206 Transfer Matrix, 86 
Lens, 177, 183 
ECR Ion Source, see Mirror, 184 
Electron, Cyclotron Round Lens, 89 
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see Dresden High Magnetic 
Field Laboratory 

Homogeneous Dipole, 73 

Hyperbola, 191 


ILC, see International Linear Collider 
Imaging, 161 
System, 44 
Independent Variable 
Arc Length, 61 
Induction Stovetop, 20 
Inert Gas, 14 
Inhomogeneous 
Deflector, 83 
Transfer Matrix, 86 
Sector Magnet, 82 
Transfer Matrix, 83 
Injection, 8 
Integer Resonance, see 
Resonance, Integer 
Integrability, 206 
Interaction Point, 235 
International Linear Collider (ILC), 
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